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Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 

1 . [3 This is a FIRST submission of items concerning a filing under 35 U.S.C. 371. 

^This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371, 

his is an express request to begin national examination procedures (35 U.S.C. 371(0). Ihe submission must include 
items (5), (6), (9) and (21) indicated below. 

4- LJ The US has been elected by the expiration of 19 months from the priority date (Article 31). 

5. [3 A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

a. Q is attached hereto (required only if not communicated by the International Bureau). 
' b. Q has been communicated by the International Bureau. 

c. Q is not required, as the application was filed in the United States Receiving Office (RO/US). 

6. □ An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 

a. Q is attached hereto. 

has been previously submitted under 35 U.S.C. 154(d)(4). 

7. Amendments to the claims of the International Aplication under PCT Article 19 (35 U.S.C. 371(c)(3)) 

a. n are attached hereto (required only if not communicated by the International Bureau). 

b. Q have been communicated by the International Bureau. 

c. n have not been made; however, the time limit for making such amendments has NOT expired. 

d. have not been made and will not be made. 

8. Q An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371 (c)(3)). 

9. [U An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

An English lanugage translation of the annexes of the International Preliminary Examination Report under PCT 
Article 36 (35 U.S.C. 371(c)(5)). 

Items 11 to 20 below concern document(s) or information included: 

1.1 I An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 

2. Q An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.3 1 is included. 

3. [3 A FIRST preliminary amendment. 

4. □ A SECOND or SUBSEQUENT preliminary amendment. 
5.1 I A substitute specification. 

6. 1 I A change of power of attorney and/or address letter. 

7. r~1 A computer-readable form of the sequence listing in accordance with PCT Rule 13ter.2 and 35 U.S.C. 1.821 - 1.825. 

8. [~| A second copy of the published international application under 35 U.S.C. 154(d)(4). 

A second copy of the English language translation of the international application under 35 U.S.C. 154(d)(4). 
20. im Other items or information: * 
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U.S. APPLIOATION NO. (iOknownrse^-Sa: tf^ 



INTERNATIONAL APPLICATION NO. 

PCT/USOO/22889 



ATTORN EY'S DOCKET NU MBER 

216019-67 



$1040.00 



.$890.00 



2 1 .[3 The following fees are submitted: 
BASIC NATIONAL FEE (37 CFR 1.492 (a) (1) - (5)): 
Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1.445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO 

International preliminary examination fee (37 CFR 1.482) not paid to 
USPTO but International Search Report prepared by the EPO or JPO . . 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO 

but international search fee (37 CFR 1, 445(a)(2)) paid to USPTO $740.00 

International preliminary examination fee (37 CFR 1.482) paid to USPTO 

but all claims did not satisfy provisions of PCTT Article 33(l)-(4) $710.00 

International preliminary examination fee (37 CFR 1 ,482) paid to USPTO 

and all claims satisfied provisions of PCT Article 33(l)-(4) $100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS PTO USE ONLY 



S 740.00 



Surcharge of $130.00 for furnishing the oath or declaration later than Q 20 | | 30 
months from the earliest claimed priority date (37 CFR 1 .492(e)). 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



$ 



Total claims 



43 -20 - 



23 



X $18.00 



$ 414,00 



independent claims 



24 



•3 = 



21 



X $84.00 



$1,764.00 



MULTIPLE DEPENDENT CLAIM(S) (if applicable) 



+ $280.00 



TOTAL OF ABOVE CALCULATIONS = 



$2,918,00 



m Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above 
are reduced by 1/2. 



SUBTOTAL - 



$2,918.00 



Processing fee of $130.00 for furnishing the English translation later than 20 O 30 
months from the earliest claimed priority date (37 CFR 1.492(f)). 



$ 



TOTAL NATIONAL FEE = 



$2,918.00 



Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property + 



$ 



TOTAL FEES ENCLOSED - 



^2.918.00 



Amount to be 
refunded: 



charged: 



a. [3 A check in the amount of $ ::>Q1 ft . QQ 

b. Please charge my Deposit Account No. 

A duplicate copy of this sheet is enclosed. 



. to cover the above fees is enclosed. 
in the amount of $ 



to cover the above fees. 



c. 13 l^he Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No. 50~0639 ^ duplicate copy of this sheet is enclosed. 

d. n Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 

information should not be Included on this form. Provide credit card information and authorization on PTO-2038. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not 
1.137 (a) or (b)) must be filed and granted to restore the application to pendin 

SEND ALL CORRESPONDENCE TO: 

Brian M- Berliner 
O'Melveny & Myers LLP 
400 South Hope Street 
Los Angeles, CA 90071-2899 



to revive (37 CFR 




Brian M. Berliner 



NAME 

34,549 



REGISTRATION NUMBER 
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10/069025 

]C1ORec'dP0r/PTO \ 9 FEB 20021- 

PATENT 
216,019-67 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicants: BARLOW et al. 
Serial No.: 

National Stage Filing Date: 

International Appln. No: PCT/USOO/22889 

International Filing Date: August 18, 2000 

Title: Gene Expression Modulated in Ataxia 
Telangiectasia Tunfiorgenesis 

PRELIMINARY AMENDMENT 

Box PCT 

Assistant Commissioner for Patents 
Washington, D C. 20231 

Sir- 
Preliminary to examination, please amend the above-identified patent application 
as follows: 



Art Unit: 



Examiner; 



LA1:961207.1 



International Application No. PCT/USOO/22889 
February 19, 2002 
Page 2 

IN THE SPECIFICATION: 

Please amend the specification as follows: 

Please replace the paragraph beginning on page 61, line 14, with the following 
rewritten paragraph: 

In general, double-stranded cDNA is generated from poly(A)-enriched 
cytoplasmic RNA extracted from the tissue samples of interest using an equimolar 
mixture of all 48 5'-biotinylated anchor primers of a set to initiate reverse transcription. 
One such suitable set is G-A-A-T-T-C-A-A-C-T-G-G-A-A-G-C-G-G-C-C-G-C-A-G-G-A- 
A-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-V-N-N (SEQ ID NO:24), where V is A, C or G 
and N is A, C, G or T. One member of this mixture of 48 anchor primers initiates 
synthesis at a fixed position at the 3' end of all copies of each mRNA species in the 
sample, thereby defining a 3* endpoint for each species, resulting in biotinylated double 
stranded cDNA. 

IN THE SEQUENCE LISTING: 

Also attached is an amended version of the entire Sequence Listing for this 
application, submitted in both written and electronic format. The amendments made to 
the sequence listing effect the following changes: (1) on page 5 of the sequence listing, 
on the 31^* line containing typed text, replace "<211> 49" with "<211> 48"; (2) on page 
5 of the sequence listing, on the 36^^ line containing typed text, change "<222> (48) . . 
(49)" to "<222> (47) . . (48)"; (3) on page 5 of the sequence listing on the 40^^ line 
containing typed text, change "<222> (47)" to "<222> (46)"; (4) on page 5 of the 
sequence listing on the 45^^ line containing typed text, change "gaattcaact ggaagcggcc 
cgcaggaatt tttttttttt ttttttvnn 49" to "gaattcaact ggaagcggcc gcaggaattt tttttttttt tttttvnn 
48". To effect the foregoing changes, please replace the sequence listing in its entirety 
with the amended sequence listing enclosed herein. We have submitted the entire 
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International Application No. PCT/USOO/22889 
February 19, 2002 
Page 3 

sequence listing in written and electronic format for replacement because the written 
and electronic sequence listing submitted herein are in the Patentin 3.1 format, and may 
differ from the Patentin 2.1 version originally submitted to the U.S. Receiving Office in 
the international stage of this application due to differences in the two software versions, 
which are primarily formatting in nature. The differences we noted were as follows: 

1) The line spacing for the sequences is double spaced for Patentin 3.1, 
whereas for Patentin 2.1 it was single spaced. Thus the overall document has more 
pages with the newer version of Patentin. 

2) In the filed application, blanks appeared for fields <140> and <141> (they are 
in the header information). With the Patentin 3.1 version, these field numbers do not 
appear at all. 

3) Notes for the primer sequences (SEQ ID NO: 24, 26, and 28) appear in a 
different order when Patentin 3.1 is used. For the sequence listing generated with 
Patentin 3.1, the Description line preceeds the misc_feature line. In the Patentin 2.1, 
this was reversed. 

4) Miscellaneous features that are 1 base pair long (see SEQ ID NO: 24) are 
listed with the same base pair number as the beginning and end nucleotide number (if 
Patentin 3.1 is used) rather than with one number (possible with Patentin 2.1). 

REMARKS 

The specification and sequence listing are amended as described above. It is 
noted that the amendments to the specification refer to page and line numbers as set 
forth in the originally filed English version of the application. No new matter has been 
added to the application by this preliminary amendment, and this submission does not 
include matter that goes beyond the disclosure in the international application as filed. 
The sequence listing recorded and submitted herewith in computer readable form is 
identical to the written sequence listing submitted herewith, and is substantially identical 
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International Application No. PCT/USOO/22889 
February 19. 2002 
Page 4 

to the sequence listing submitted previously (except for the amendment described 
above and version differences noted above). Accordingly, the Applicant respectfully 
requests an examination of the application on the merits in view of the foregoing 
amendments. 

These amendments result from a typographical error that was recently noticed, 
i.e., an extra "C" had been introduced into the Not I site of one set of anchor primers, 
SEQ ID NO:24. The amendments correct an obvious error, and no new matter is 
added. The text of the application makes it clear that the anchor primer contains a NoU 
site. One of ordinary skill would understand that the sequence of the Not I site is a 
palindrome. G-C-G-G-C-C-G-C. The typographical error, G-C-G-G-C-C-C-G-C, would 
be recognized as an obvious error because it is not a palindrome, and not the NoU 
recognition site sequence. See page 48 of the 1998/1999 New England Biolabs 
Catalog, attached as Exhibit 1 . 

Attached hereto is a marked-up version of the changes made to the specification 
by the current amendment. The attached page Is captioned "Version with marlcings to 
show changes made." 



O'MELVENY & IWIYERS LLP 

400 So. Hope Street 

Los Angeles, California 90071-2899 

Telephone: 213-430-6000 

Enclosures: (1) New England Biolabs Catalog Pages 

(2) Amended Sequence Listing (written) 

(3) Amended Sequence Listing (diskette. Patentin 3.1) 



Date: February 19, 2002 




BrtarrM. Berliner 
Attorney for Applicants 
Registration No. 34,549 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

IN THE SPECIFICATION: 

The paragraph beginning on page 61, line 14 is amended as follows: 

In general, double-stranded cDNA is generated from poly(A)-enriched 
cytoplasmic RNA extracted from the tissue samples of interest using an equimolar 
mixture of all 48 5'-biotinylated anchor primers of a set to initiate reverse transcription. 
One such suitable set is G-A-A-T-T-C-A-A-C-T-G-G-A-A-G-C-G-G-C-C-[C-]G-C-A-G-G- 
A-A-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-V-N-N (SEQ ID NO:63), where V is A, C or G 
and N is A, C, G or T. One member of this mixture of 48 anchor primers initiates 
synthesis at a fixed position at the 3' end of all copies of each mRNA species in the 
sample, thereby defining a 3' endpoint for each species, resulting in biotinylated double 
stranded cDNA. 



LAI :961 207.1 



wo 01/12647 



PCT/USOO/22889 



SEQUENCE LISTING 



<110> 



Barlow, Carrolee 



Winrow, Christopher J 
Callahan, Marie Lei A 
Pankratz, Daniel G 

<12 0> Gene Expression Modulated in Ataixia Telangiectasia 
Tumorgene s i s 

<130> 99-529B 

<140> 
<141> 

<150> 60/149,868 

<151> 1999-08-19 

<160> 52 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 406 

<212> DNA 

<213> Mus musculus 

<400> 1 

cggacgggga attcccaaaa acactgcacg aagttaagct gacagtacag aaggatcagg 60 

tgtgtgagtc ccagttccaa agttcttaca acagagctaa tgagatatgt gtgggagact 12 0 

caaagatcaa gggagcttcc tttgaggagg attctggagg cccgcttgtg tgtaaaagag 18 0 

cagctgca.gg catcgtctcc tacgggcaaa ctgatggatc agctccacaa gtcttcacaa 240 

gagttttggg ttttgtatcg tggataaaga aaacgatgaa acacagctaa ctacaagaag 300 

caaactagat cctgatctga ccagccatct tccccatagc tgagtccagg attgctctag 360 

gacagatggc agcaactgaa taaagaactt tctctgactc caaaaa 4 06 



<210> 2 

<211> 54 

<212> DNA 

<213> Mus musculus 



<210> 3 

<211> 101 

<212> DNA 

<213> Mus musculus 

<400> 3 

cggccgtgtg tgccttagga gtgggaaact ttgcafcttct ctctccttat ccttcttgta 60 
agacatccat ttaataaagt ctcatgctga gagagtaaaa a 101 



<210> 4 

<211> 258 

<212> DNA 

<213> Mus musculus 



<400> 2 

cggcagaggc tcctggcctg cacttttcct 



attaaagtca 



ctgacacaca aaaa 



54 



1 
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<400> 4 

cggctcgcag agtgaaggca cccagccatc aggaatgcag ccctgccagc caggcaccat 60 
gggtggcagc catcatgctt tcatgaccat tgggcctgct ggtctacctg gtcttagccc 120 
aggaagccac caggtaggtt agggtggtca gtgccgagtc tcctgccaga cacagttata 180 
cctgcctttc tgcactgctc caggcatgcc cttagagcat ggtgttttaa agctaaataa 24 0 
agtctctaac tiacaaaaa 258 



<210> 5 
<211> 116 
<212> DNA 

<213> Mus musculus 
<400> 5 

cgggctcagc actgtctgct cacgtcccca ctgttgcaac acctagtctg tttgcaagga 60 
ggacaggaca ggtcacatgg agctttatgt caafcaaagtc tttatcttgt caaaaa 116 



<210> 6 
<211> 68 
c212> DNA 

<213> Mus musculus 



<400> 6 

cggaacgggg gatgatgggg gattggaaga acttgaactt tttattaaaa tattgtaagt 6 0 
cctaaaaa 68 



<210> 7 
<211> 172 
<212> DNA 

<213> Mus musculus 
<400> 7 

cggaccgcca tgcctggtga gcatatccat cc tttgaaag atacactcat gtcctaaacg 60 
gagctgtgac tgagatttag tgttttcaat atgaacaaaa agcatttgcc tagcaaatta 12 0 
agtticaaatg aaggcttagt. cet:t:t:gaaga ataaagcat:a aaaccaticaa aa 172 



<210> 8 
<211> 151 
<212> DNA 

<213> Mus musculus 



<400> 8 

cggacgctgg agccttcctt aatccacacc 
tacactgatt acatttcagg aggacgtgtt 
ccttaataaa tattttgcat actttcaaaa 



tttgaagcca attctcaact ggttgccgtg 6 0 
ctatttatgt tgtgcctctg caggcaaagc 120 
a 151 



<210> 9 
<211> 300 
<212> DNA 

<213> Mus musculus 



<400> 9 

cggacggtgt accccgagga tcgccccagg 
agcaggagta gaaccccatt tcgcctgtgt 
gcaaaagcca acgcagcaaa agctctggga 
cgagccaagg aggcaagcca ggggacagct 
tcagaaaagc agactgaaga tacaactaaa 



tggagggaaa gatccaggac caggtcgcgc 60 

gagaaagatc gaatggagct actagaaata 120 

acagccaact tcgacttgcc agcaagtctc 180 

gtttccagca gtgggccaaa ggtggagcat 240 

aataccagtig aaaagtctiti'c tacacaaaaa 300 
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<210> 10 
<211> 43 
<212> DNA 

<213> Mus musculus 
<400> 10 

cggagtcccc tcctctgaat taaaatcaca ctgagcggaa aaa 43 



<210> 11 
<211> 400 
<212> DNA 

<213> Mus musculus 
<400> 11 

cggcatcgtc cttgttaagc actgctttcc 
attttatagt ttcactctaa tgaagtcctg 
tcccctgagt ggcagcagca ggaagccccg 
acacccagtg gcattctttt tactataacg 
aaatttgcaa cagacctcct aacttttgtt 
aatgaaaaat gttaaaaaga caactggatc 
tttgtgtgaa ttgatttagt aaaatgttaa 



agaccgtgtc tgacttagtc aacctagagg 6 0 
attaacacag agcagcggag cagcggtccc 12 0 
tttgtaatgt gcgttgcgct gctttaccgt 18 0 
gttttccccc ctctcagaag agaaatctta 24 0 
tacttttttt gtcttaacga tagatttgaa 30 0 
tgttgtccgt attatatttg caatctgaaa 360 
ac t ggaaaaa 400 



<210> 12 
<211> 219 
<212> DNA 

<213> Mus musculus 

<400> 12 

cggcctggac cacatcacgg aggaagaact 
gtttcagatt gccaggagca cactcgacgt 
gattcaaagt gcatgtgcgg cgtctggagt 
gaattatatg catattgcaa atgctcttga 



gaaacggttc aagtactttg ctttgactga 60 
cgcagatagg acagaattag ctgaccacct 12 0 
gaccaaggcc attaatattt tccagaagtt 180 
agagaaaaa 219 



<210> 13 
<211> 89 
<212> DNA 

<213> Mus musculus 
<400> 13 

cgggctcttt gcacagttct aacctataca acttggtctg ctcctcctac caacttattt 60 
aatagacact aaagtiaaaac gcccaaaaa 89 

<210> 14 
<211> 42 
<212> DNA 

<213> Mus musculus 
<400> 14 

cggct:atcag t.gctaacaga aaacatitgtia gaaggccaaa aa 42 



<210> 15 
<211> 247 
<212> DNA 

<213> Mus musculus 
<400> 15 



3 
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cggaccatcc ctgctctgtg tccatctact 

agcttcactg gacactgagg ggtcacagac 

tcctagactt tatcatcttt cagaggcggt 

ctttccactg caattaatct tggtcactcc 
ggaaaaa 



catccaatgg agtcattagc agtcaatcgc 60 
ttaggctccc tttgggtcac gtccagcgtg 120 
aggcagactg ttcacaaagg ctttctctag 180 
ctcaaataag tatatttttt ctagaaaagg 240 

247 



<210> 16 

<211> 260 

<212> DNA 

<213> Mus musculus 

<400> 16 

cggaccctga ttcgcatcat ggtgtctcgc 
gagtataagc ggatgtacgg caagtcactg 
gactatcgaa agattccact gaagatctgt 
cacctgtgcc ccctgctggc attggccagg 
ctatctaata accccaaaaa 



ggcgagcttg acctgctgga tatccgagca 60 
taccacgata tcacgggaga tacttctgga 120 
ggtggcaacg actgagctgt ggcagtggct 180 
aaacgtccca aagagcacac ttcggt.ttcc 240 

260 



<210> 17 

<211> 209 

<:212> DNA 

<213> Mus musculus 

<400> 17 

cggaccgtgc aaaggtagca taatcacttg 
aaacgagggt ccaactgtct cttatcttta 
ctgaaatata ataataagac gagaagaccc 
ttaatttatt aaacctaatg gcccaaaaa 



ttccttaatt agggactagc atgaacggct 60 
atcagtgaaa ttgacctttc agtgaagagg 120 
tatggggctt aaattatata acttatctat 180 

209 



<210> 18 

<211> 385 

<212> DNA 

<213> Mus musculus 

<400> 18 

cggs^gtggac agcacaagcc atggaagcgc 
gcactgcatc tttgccactt cgtgcagatt 
tgaccttaga catctcagcc tcctttgctg 
aagtgcatcg tttaaacata atagctcccg 
cagacgcttt gaaaacttgt tctttaagtg 
gatcactttg tagtcatgta aaacttaagt 
aataaacgga gcggtgagcc aaaaa 



agatggccca gtgagccctc agcgctgtgc 60 

gattcactgc ccctcaaagc atttgcatca 120 

gatcccactt gaagaagatg ctiagagcaag 180 

tgagatctgg gtctgtgtat tgaactaact 240 

gaagtgaaag acctgaagtt: tgggatttgt 300 

gctttgtgcc tctccacatg tggttactct 360 

385 



<210> 19 

<211> 83 

<212> DNA 

<213> Mus musculus 

<400> 19 

cggccgaggc gaggcgccgc gcggaaaacc gcggcgggtg ttgacgcgat gtgatttctg 60 
cccagtgctc tgaatgtcaa aaa 83 



<210> 20 

<211> 286 

<212> DNA 

<213> Mus musculus 



4 
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<400> 20 

<=^9^9999ccaa gacgaggagg tgctgatgat 
gataggggtc ccaggagagg catggatgat 
gaccgaggac cttggaggaa tgctgctgaa 
gaccgagggc cttggagaaa tatggatgat 
agacctggtc cttggagatc atttgtcgag 



gaacggtcat cctggcggaa tgctgatgat 60 
gataggggtc ccaggcgagg gctggatgat 120 
gataggattt ccaggagagg tgcagatgat 180 
gatagggttc ctagaagggg tgatgatgca 240 
ccaggtggat gaaaaa 286 



<210> 21 

<211> 124 

<212> DNA 

<213> Mus mus cuius 



<400> 21 

cggtaccaat gtatagctgt gttggagaaa 
acattctcat ctgtctctct ggtgaccgat 
aaaa 



tatggaagat aaactataca ggatatttgc 60 
aattcagttt gctccagctg cccaagccca 120 

124 



<210> 22 

<211> 68 

<212> DNA 

<213> Mus musculus 

<400> 22 

cggtagcggg aacaccgaga acgaagccag cctggctttc gaaaacgcct gggtttgctg 60 
gagaaaaa 68 



<210> 23 

<211> 160 

<212> DNA 

<213> Mus musculus 



<400> 23 

cggtgcagca ccgtgaggtc cagtaccacg 
agggcatcat ctataaaaag gggggtgtgg 
cctatgatgt gaagaggctg ctacacgtaa 



agtccgacac cttccgtggc tacttcaagc 60 
cctctgggat gaagcacgtg gagaccaaca 120 
aggggaaaaa 160 



<210> 24 

<211> 49 

<212> DNA 

<213> Artificial Sequence 



<220> 

<221> misc_f eature 

<222> (48) . . (49) 

<223> N stands for any nucleotide . 



<220> 

<221> misc_f eature 
<222> (47) 

<223> V stands for A, G, or C. 



<220> 

<223> Description of Artificial Sequence: synthetic primer 
<400> 24 

gaattcaact ggaagcggcc cgcaggaatt tttttttttt ttttttvnn 4 9 



5 
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<210> 25 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: synthetic 
primer 

<400> 25 

aggtcgacgg tatcgg 

<210> 26 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> 

<2 21> misc_f eature 
<222> (16) 

<223> N stands for any nucleotide. 

<223> Description of Artificial Sequence: synthetic 
primer 

<400> 26 

ggtcgacggt atcggn 

<210> 27 
<211> 15 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: synthetic 
primer 

<400> 27 

gagctccacc gcggt 

<210> 28 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> (13) . . (16) 

<223> N stands for any nucleotide. 

<223> Description of Artificial Sequence: synthetic 
primer 

<400> 28 

cgacggtatc ggnnnn 

<210> 29 
<21l> 16 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: synthetic 
primer 
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<400> 29 

cgacggtatc gggagc 



16 



<210> 30 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_7 . 

<400> 30 

gatcgaatcc ggacggtgta ccccgaggat 3 0 

<210> 31 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_8 , 



<210> 32 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_11. 

<400> 32 

gatcgaatcc ggccgtgtgt gccttaggag 30 

<210> 33 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 



<400> 31 

gatcgaatcc gggctctttg cacagttcta 



30 



primer for BAR1_16. 



<400> 33 

gatcgaatcc ggaacggggg atgatggggg 



30 



<210> 34 
<211> 30 
<212> DNA 



<213> Artificial Sequence 



7 



WO 01/12647 



PCT/USOO/22889 



<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_20. 



<400> 34 

gatcgaatcc ggtaccaatg tatagctgtg 



30 



<210> 35 
<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BARl 21. 

<400> 35 

gatcgaatcc ggcagaggct cctggcctgc 30 

<210> 36 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_22 . 



<:210> 37 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_24 . 

<400> 37 

gatcgaatcc gggctcagca ctgtctgctc 30 

<210> 38 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_27. 



<400> 36 

gatcgaatcc ggctatcagt gctaacagaa 



30 



<400> 38 

gatcgaatcc ggacggggaa ttcccaaaaa 



30 



<210> 39 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; Extended TOGA 
primer for BAR1_28 . 

<400> 39 

gatcgaatcc ggctcgcaga gtgaaggcac 30 

<210> 40 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_2 9. 



<210> 41 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_34 . 

<400> 41 

gatcgaatcc ggaccatccc tgctctgtgt 30 

<210> 42 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_3 5. 



<210> 43 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_37- 

<400> 43 

gatcgaatcc ggaccgccat gcctggtgag 30 



<400> 40 

gatcgaatcc ggagtcccct cctctgaatt 



30 



<400> 42 

gatcgaatcc ggaccctgat tcgcatcatg 



30 
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<210> 44 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
c220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BARl 38. 

<400> 44 

gatcgaatcc ggaccgtgca aaggtagcat 30 



<210> 45 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_39. 

<400> 45 

gatcgaatcc ggacgctgga gccttcctta 30 



<210> 46 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_41. 

<400> 46 

gatcgaatcc ggagtggaca gcacaagcca 3 0 



<210> 47 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_49. 

<400> 47 

gatcgaatcc ggcatcgtcc ttgttaagca 30 



<210> 48 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_50. 

<400> 48 
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gatcgaatcc ggccgaggcg aggcgccgcg 



30 



<210> 49 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Extended TOGA 
primer for BAR1_52 . 

<400> 49 

gatcgaatcc ggcctggacc acatcacgga 3 0 

<210> 50 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_62 . 



<210> 51 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_67. 

<400> 51 

gatcgaatcc ggtagcggga acaccgagaa 30 

<:210> 52 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA 
primer for BAR1_69. 



<400> 50 

gatcgaatcc gggggccaag acgaggaggt 



30 



<400> 52 

gatcgaatcc ggtgcagcac cgtgaggtcc 



30 
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GENE EXPRESSION M ODULATED IN_ 
ATAXIA 1 ELANGiECTTLSlA TUMORGENESIS ^^ 

BACKGROUND OF THE INVENTION 
Ataxia telangiectasia (AT) is a human autosomal recessive multisystem disorder 
comprising progressive cerebellar ataxia with onset in infancy, progressive oculocutaneous 
telangiectasia, and unusual susceptibility to progressive bronchopulmonary disease and to 
lymphoreticular neoplasia. See, generally, Boder, E., (1987) Ataxia-Telangiectasia, pp. 95- 
1 17 in Gomez, M.R. and Adams, R.D., Eds., Neurocutaneous Disease. A Practical Approach . 
Butterworths, Boston. In addition, AT generally includes other progressive neurologic 
degenerations such as choreoathetosis and oculomotor dysfunction, recurrent sinopulmonary 
infections secondary to immunodeficiency, lymphoreticular malignancies, growth retardation, 
incomplete sexual maturation, endocrine abnormalities, and premature aging of the skin and 
hair. Barlow, C. et al., (1996)y4r/w-deficient mice: a paradigm of ataxia telangiectasia. Cell, 
86:159-171. The disease is progressive, and death generally occurs by the second or third 
decade of life due to neurologic deterioration or lymphoreticular malignancies, which occur in 
about 10% - 15% of patients. Taylor, A.M. et al. (1996) Leukemia and lymphoma in ataxia 
telangiectasia. Blood, 87:423-438. No effective treatments have been found to alter the course 
of the disease. The frequency of AT in the United States and Britain has been estimated to be 
between 1:40,000 and 1:100,000, resulting in a carrier frequency of 0.5% - 1.0% (Barlow et al., 
1996). Heterozygous carriers may also have a predisposition to cancer, particularly breast 
cancer. 

AT has been mapped to human chromosome position 1 lq22-23, and the responsible 
gene (designated ATM, "AT mutated") has been has been defined by positional cloning. The 
mouse homolog, Atm^ has also been identified, Atm shows an 84% amino acid identity and a 
9 1 % similarity with A TM. Sequence comparisons revealed that both ATM and Atm are 
members of a family of genes involved at several stages in double-stranded DNA breakage 
repair and cell cycle control. 

The transformation of a normal cell to a tumor cell appears to depend in part on 
mutations in genes that normally control the cell cycle. Cell cycle checkpoints are believed to 
play a major role in maintaining the integrity of the genome. Defects in these control points 



i0/0b9025 



1 



wo 01/12647 PCT/USOO/22889 
may contribute to increased incidence of genomic alterations such as deletions, translocations, 
and amplifications that are common during the evolution of a normal cell to a cancer cell. 



ATM is a key protein for managing cell cycle perturbations in response to DNA damage 
5 and plays a role in genetic stability and cancer susceptibility. Inactivation of normal ATM 

function could thus result in inappropriate replication of damaged DNA, thus contributing to the 
malignant transformation of cells. Available evidence indicates that ATM may be a critical 
regulator of many important cellular processes and thus has potential implications for cancer in 
the general population. Notably, many studies have indicated that loss of ATM function is 

10 related to cancer predisposition in AT heterozygotes. In particular, it has been foimd that 

women heterozygous for one mutated allele of ^rAf have a three to fivefold increase in breast 
cancer risk. Importantly, it has also been recently established that patients with loss of 
heterozygosity at 1 lq22 (the chromosomal location of ATM) and specifically have loss of ATM 
function, develop particularly aggressive forms of both B and T cell lymphomas. These studies 

1 5 suggest that ATM is a tumor suppressor gene whose inactivation is a key event in the 

development of many forms of cancer, particularly in T-cell prolymphocytic leukemias and 
aggressive forms of B cell chronic lymphocytic leukemia in patients who do not have AT. 

SUMMARY OF THE INVENTION 

20 The PCR-based Total Gene Expression Analysis (TOGA™) differential display system 

has been used in studies to examine how in ataxia telangiectasia tumorgenesis is regulated. 
Molecules have been identified that correspond to genes that are regulated in ataxia 
telangiectasia tumorgenesis. Such molecules are useful in the therapy of cancer, including 
treatment, prevention, and amelioration of cancer, and in the diagnosis of cancer. 

25 

The present invention provides novel polynucleotides and the encoded polypeptides. 
Moreover, the present invention relates to vectors, host cells, antibodies, and recombinant 
methods for producing the polynucleotides and the polypeptides. Also provided are diagnostic 
methods for detecting disorders related to the polypeptides and the polynucleotides encoding 
30 them, and therapeutic methods for treating such disorders. The invention further relates to 
screening methods for identifying binding partners of the polypeptides. 



In particular, it has been found that because of the close correspondence between human 
ATM and mouse Atm genes, the tumors that develop in Atm-d^f\c\cnt mice provide the ability to 

2 
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specifically characterize the genes involved in cancer formation as a model of particular forms 
of human hematopoietic cancer. Cancer cell lines established from several T-cell lymphomas 
from multiple different Atm deficient animals are usefijl for the study of human cancers, as well 
as for the development of suitable diagnostic and therapeutic compositions and methods. An 
5 extensive cellular analysis of these tumors has been perfomied and show that they invariably 
arise at a stage of development when the TCRa undergoes rearrangement. It has been 
specifically found that the translocations occur at the TCRa locus in ten of ten tumor cell lines 
established fi*om these mice. Therefore, it is likely that the tumors arise due to aberrant V(D)J 
rearrangements of the TCRa locus as seen in human cancers. In addition, it has been 
10 specifically found that regions of chromosome 12 are also involved. These regions are syntenic 
to the regions on human chromosomes known to be fi-equently mutated in human hematopoietic 
cancer. The Atm deficient tumor cell lines disclosed herein are usefiil for characterizing the 
genes involved in the multi-step process of cancer formation. 

15 One embodiment of the invention provides an isolated nucleic acid molecule comprising 

a polynucleotide chosen fi-om the group consisting of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 
NO;9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ 
ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, 

20 SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:23. Further provided is a an isolated nucleic 
acid molecule comprising a polynucleotide at least 95% identical to the isolated nucleic acid 
molecule comprising a polynucleotide chosen fi"om the group consisting of SEQ ID NO:l, SEQ 
ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9, SEQ ID NOilO, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ 

25 ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, 
SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:23. Also provided is an 
isolated nucleic acid molecule at least ten bases in length that is hybridizable to an isolated 
nucleic acid molecule selected fi^om the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ 
ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 

30 NO:9, SEQ ID NOrlO, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ 
ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, 
SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:23 under stringent conditions. Preferably, an 
isolated nucleic acid molecule of the invention encodes a polypeptide epitope. A 
polynucleotide of the invention can fiirther comprise sequential nucleotide deletions fi-om either 
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the 5 '-terminus or the 3 '-terminus. A polynucleotide of the invention can be present in a 
recombinant vector and in a recombinant host cell. 



Another embodiment of the invention provides an isolated polypeptide encoded by a 
5 polynucleotide chosen from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 
NO:9, SEQ ID NO:10, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO: 14, SEQ 
ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, 
SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:23. Further provided is an isolated nucleic 

10 acid molecule encoding these polypeptides. Also provided is an isolated nucleic acid molecule 
encoding a fragment of a polypeptide encoded by a polynucleotide chosen from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ 
ID NO:6, SEQ ID NO:7, SEQ ID NO: 8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ 
ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 

15 SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, and SEQ ID 
NO:23, Additionally provided is an isolated nucleic acid encoding a species homologue of a 
polypeptide of the invention. Preferably, a polypeptide of the invention has biological activity. 
A polypeptide of the invention can comprise sequential amino acid deletions from either the C- 
terminus or the N-terminus. 

20 

Still another embodiment of the invention provides an isolated antibody that binds 
specifically to an isolated polypeptide of the invention. The antibody can be a monoclonal 
antibody or a polyclonal antibody. 

25 Yet another embodiment of the invention provides an isolated polypeptide. The 

polypeptide is produced by culturing a recombinant host cell that comprises a polynucleotide of 
the invention under conditions such that the polj^eptide is expressed and isolating the 
polypeptide. 

30 Another embodiment of the invention provides a method for preventing, treating, 

modulating, or ameliorating a medical condition, such as ataxia telangiectasia, comprising 
administering to a mammalian subject a therapeutically effective amount of a polypeptide of the 
invention or a polynucleotide of the invention. 
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A further embodiment of the invention provides an isolated antibody that binds 
specifically to the isolated polypeptide of the invention. A preferred embodiment of the 
invention provides a method for preventing, treating, modulating, or ameliorating a medical 
condition, such as ataxia telcuigiectasia, comprising administering to a mammalian subject a 
5 therapeutically effective amount of the antibody. 



An additional embodiment of the invention provides a method of diagnosing a 
pathological condition or a susceptibility to a pathological condition in a subject. The method 
comprises determining the presence or absence of a mutation in a polynucleotide of the 
10 invention. A pathological condition or a susceptibility to a pathological condition, such as 
ataxia telangiectasia is diagnosed based on the presence or absence of the mutation. 

Even another embodiment of the invention provides a method of diagnosing a 
pathological condition or a susceptibility to a pathological condition, such as ataxia 

1 5 telangiectasia in a subject. The method comprises detecting an alteration in expression of a 
polypeptide encoded by the polynucleotide of the invention, wherein the presence of an 
alteration in expression of the polypeptide is indicative of the pathological condition or 
susceptibility to the pathological condition. The alteration in expression can be an increase in 
the amount of expression or a decrease in the amount of expression. In a preferred embodiment 

20 a first biological sample is obtained from a patient suspected of having ataxia telangiectasia and 
a second sample fi"om a suitable comparable control source is obtained. The amount of at least 
one polypeptide encoded by a polynucleotide of the invention is determined in the first and 
second sample. The amount of the polypeptide in the first and second samples is determined. 
A patient is diagnosed as having ataxia telangiectasia if the amount of the polypeptide in the 

25 first sample is greater than or less than the amount of the polypeptide in the second sample. 



Yet another embodiment of the invention provides a method for identifying a binding 
partner to a polypeptide of the invention. The polypeptide is contacted v^ith a binding partner. 
Whether the binding partner effects an activity of the polypeptide is determined. 

30 

Even another embodiment of the invention provides a method of identifying an activity 
of an expressed polypeptide in a biological assay. A polypeptide of the invention is expressed 
in a cell. The expressed polypeptide is isolated and tested for an activity in a biological assay. 
The activity of the expressed polypeptide is identified based on the test results. 

35 



5 



wo 01/12647 PCT/USOO/22889 
Another embodiment of the invention provides a substantially pure isolated DNA 
molecule suitable for use as a probe for genes regulated in ataxia telangiectasia. The DNA 
molecule is chosen from the group consisting of the DNA molecules identified in SEQ ID 
NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
5 NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID 
NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ 
ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:23. 

Still another embodiment of the invention provides a kit for detecting the presence of a 
10 polypeptide of the invention in a mammaliem tissue sample comprising a first antibody which 
immunoreacts with a manmialian protein encoded by a gene corresponding to a polynucleotide 
or polypeptide of the invention in an amount sufficient for at least one assay and suitable 
packaging material. The kit can fiirther comprise a labeled or unlabeled second antibody that 
binds to the first antibody. A label can comprise enzymes, radioisotopes, fluorescent 
15 compounds, colloidal metals, chemiluminescent compounds, phosphorescent compounds, or 
bioluminescent compounds. 

Yet another embodiment of the invention provides a kit for detecting the presence of a 
genes encoding an protein comprising a polynucleotide of the invention, or fragment thereof 
20 having at least 10 contiguous bases, in an amount sufficient for at least one assay, and suitable 
packaging material. 

Even another embodiment of the invention provides a method for detecting the presence 
of a nucleic acid encoding a protein in a mammalian tissue sample. A polynucleotide of the 
25 invention or fragment thereof having at least 10 contiguous bases is hybridized with the nucleic 
acid of the sample. The presence of the hybridization product is detected. 



30 



BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects, and advantages of the present invention will become 
better understood with reference to the following description, appended claims, and 
accompanying drawing where: 
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Figure 1 is a graphical representation of the results of TOGA analysis using a 5' PGR 
primer with parsing bases GAGG, showing PGR products produced from mRNA extracted 
from cell lines AT-4 (Fig. lA), AT-? (Fig. IB), AT-12 (Fig. IG), and AT-13 (Fig. ID); the 
arrow indicates the vertical cursor connecting the position of peaks of length 197 base pairs; 

5 

Figure 2 is a graphical representation of the results of Northern Blot analysis using 
radiolabeled probe derived from a candidate gene (U88908) corresponding to GAGG 197, 
showing, hybridization with a band of mRNA at 4.2 kb that is expressed at various levels in cell 
lines AT-4, AT-7, AT-10, AT-1 1, AT-13; APT-3, P53-1, 101-7, 292-3, and thymus, but not 
10 detectable in cell line AT-12; the image below shows the methylene blue stained gel for 
quantification of mRNA loading; 

Figure 3 is a graphical representation of the results of TOGA analysis using a 5* FOR 
primer with parsing bases GCTG, showing PGR products produced from mRNA extracted from 
15 cell lines AT-4 (Fig. 3A), AT-7 (Fig. 3B), AT-12 (Fig. 3G), and AT-13 (Fig. 3D); the arrow 
indicates the vertical cursor connecting the position of peaks of length 345 base pairs; 

Figure 4 is a graphical representation of the results of Northern Blot analysis using 
radiolabeled probe derived from a candidate gene (Z31664) corresponding to GGTG 345, 
20 showing, hybridization with bands of mRNA at 4.0 kb and 3.7 kb that are expressed at various 
levels in cell lines AT-4, AT-1 1, AT-12, AT-13, 292-3, and polyA (pA) thymus mRNA, but not 
detectable in cell lines AT-7, AT-10, APT-3, P53-1, 101-7; the image below shows the 
methylene blue stained gel for quantification of mRNA loading; 

25 Figure 5 is a graphical representation of the results of TOGA analysis using a 5* PGR 

primer with parsing bases AGAT, showing PGR products produced from mRNA extracted from 
cell lines AT.4 (Fig. 5A), AT-7 (Fig. 5B), AT-12 (Fig. 5G), and AT-13 (Fig. 5D); the arrow 
indicates the vertical cursor connecting the position of peaks of length 367 base pairs; 

30 Figure 6 is a graphical representation of the results of Northem Blot analysis using 

radiolabeled probe derived from a candidate gene (U38252) corresponding to AGAT 367, 
showing, hybridization with a band of mRNA at 4.1 kb that is expressed at various levels in cell 
lines AT-7, AT-10, AT-12, AT-13 and APT-3, but not detectable in cell lines AT-4, AT-1 1, 
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P53-1, 101-7, 292-3 and polyA (pA) thymus mRNA; the image below shows the methylene 
blue stained gel for quantification of mRNA loading; 



Figure 7 is a graphical representation of the results of TOGA analysis using a 5* PGR 
5 primer with parsing bases ACGG, showing PGR products produced firom mRNA extracted 
from cell lines AT-4 (Fig. 7A), AT-7 (Fig. 7B), AT-12 (Fig. 7G), and AT-13 (Fig. 7D); the 
arrow indicates the vertical cursor connecting the position of peaks of length 458 base pairs; 

Figure 8 is a graphical representation of the results of Northem Blot analysis using 
10 radiolabeled probe derived fi-om a candidate gene (XI 2822) corresponding to AGGG 458, 

showing, hybridization with bands of mRNA at 1.7 kb and 1.1 kb that are expressed at various 
levels in cell lines AT-7 and AT- 10, but not detectable in cell lines AT-4, AT-12, AT-1 1, AT- 
13; APT-3, P53-1, 101-7, and 292-3, and polyA (pA) thymus (only 1.1 kb) mRNA; the image 
below shows the methylene blue stained gel for quantification of mRNA loading; 

15 

Figure 9 is a graphical representation of the results of TOGA analysis using a 5' PGR 
primer with parsing bases CCGT, showing PGR products produced from mRNA extracted from 
cell lines AT-4 (Fig. 9 A), AT-7 (Fig. 9B), AT-12 (Fig. 9C), and AT-13 (Fig. 9D); the arrow 
indicates the vertical cursor connecting the position of peaks of length 151 base pairs; and 

20 

Figure 10 is a graphical representation of a more detailed analysis of the 151 b.p. PGR 
product indicated in Figure 9, using the extended TOGA primer G-A-T-G-G-A-A-T-G-G-G-G- 
G-G-G-T-G-T-G-T-G-G-G-T-T-A-G-G-A-G (SEQ ID NO:32). 

25 

DETAILED DESGRIPTION OF THE PREFERRED EMBODIMENTS 

Definitions 

30 The following definitions are provided to facilitate understanding of certain terms used 

throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 

environment (e.g., the natural environment if it is naturally occurring), and thus is altered "by 

35 the hand of man" from its natural state. For example, an isolated polynucleotide could be part 

of a vector or a composition of matter, or could be contained within a cell, and still be 

8 
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"isolated" because that vector, composition of matter, or particular cell is not the original 
environment of the polynucleotide. 



In the present invention, a "secreted" protein refers to those proteins capable of being 
5 directed to the ER, secretory vesicles, or the extracellular space as a result of a signal sequence, 
as well as those proteins released into the extracellular space without necessarily containing a 
signal sequence. If the secreted protein is released into the extracellular space, the secreted 
protein can undergo extracellular processing to produce a "mature" protein. Release into the 
extracellular space can occur by many mechanisms, including exocytosis and proteolytic 
10 cleavage. 

As used herein, a "polynucleotide" refers to a molecule having a nucleic acid sequence 
shown in SEQ ID NOs: 1 -23. For example, the polynucleotide can contain all or part of the 
nucleotide sequence of the full length cDNA sequence, including the 5' and 3' untranslated 
1 5 sequences, the coding region, with or without the signal sequence, the secreted protein coding 
region, as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers to a molecule having the translated amino acid 
sequence generated from the polynucleotide as broadly defined. 

20 A "polynucleotide" of the present invention also includes those polynucleotides capable 

of hybridizing, under stringent hybridization conditions, to sequences contained in SEQ ID 
NOs: 1-23, or the complement thereof, or the cDNA. "Stringent hybridization conditions" refers 
to an overnight incubation at 42°C in a solution comprising 50% formamide, 5X SSC (750 mM 
NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 7,6), 5X Denhardfs solution, 1 0% 

25 dextran sulfate, and 20 ^g/ml denatured, sheared salmon sperm DNA, followed by washing the 
filters in 0.1 X SSC at about 65*^0. 

Also contemplated are nucleic acid molecules that hybridize to the polynucleotides of 
the present invention at lower stringency hybridization conditions. Changes in the stringency of 
30 hybridization and signal detection are primarily accomplished through the manipulation of 
formamide concentration (lower percentages of formamide result in lowered stringency); salt 
conditions, or temperature. For example, lower stringency conditions include an overnight 
incubation at 37°C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH2P04; 
0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 \xg/ml salmon sperm blocking DNA; 
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followed by washes at SO^'C with IX SSPE, 0. 1% SDS. In addition, to achieve even lower 
stringency, washes performed following stringent hybridization can be done at higher salt 
concentrations (e.g., 5X SSC). 



5 Note that variations in the above conditions may be accomplished through the inclusion 

and/or substitution of alternate blocking reagents used to suppress background in hybridization 
experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, 
denatured salmon sperm DNA, and commercially available proprietary formulations. The 
inclusion of specific blocking reagents may require modification of the hybridization conditions 
10 described above, due to problems with compatibility. 

Of course, a polynucleotide whidi hybridizes only to polyA+ sequences (such as any 3' 
terminal polyA+ tract of a cDNA shown in the sequence listing), or to a complementary stretch 
of T (or U) residues, would not be included in the definition of "polynucleotide," since such a 
15 polynucleotide would hybridize to any nucleic acid molecule containing a poly (A) stretch or 
the complement thereof (e.g., practically any double-stranded cDNA clone). 

A polynucleotide of the present invention can be composed of any polyribonucleotide or 
polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 

20 For example, polynucleotides can be composed of single- and double-stranded DNA, DNA that 
is a mixture of single- and double-stranded regions, single- and double-stranded RNA, and 
RNA that is mixture of single- and double-stranded regions, hybrid molecules comprising DNA 
and RNA that may be single-stranded or, more typically, double-stranded or a mixture of 
single- and double-stranded regions. In addition, a polynucleotide can be composed of triple- 

25 stranded regions comprising RNA or DNA or both RNA and DNA. A polynucleotide may also 
contain one or more modified bases or DNA or RNA backbones modified for stability or for 
other reasons. "Modified" bases include, for example, tritylated bases and unusual bases such 
as inosine. A variety of modifications can be made to DNA and RNA; thus, "polynucleotide" 
embraces chemically, enzymatically, or metabolically modified forms. 

30 

The polypeptide of the present invention can be composed of amino acids joined to each 
other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may contain 
amino acids other than the 20 gene-encoded amino acids. The polypeptides may be modified 
by either natural processes, such as posttranslational processing, or by chemical modification 
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techniques which are well known in the art. Such modifications are well described in basic 
texts and in more detailed monographs, as well as in a voluminous research literature. 
Modifications can occur anywhere in a polypeptide, including the peptide backbone, the amino 
acid side-chains and the amino or carboxyl termini. It will be appreciated that the same type of 
5 modification may be present in the same or varying degrees at several sites in a given 

polypeptide. Also, a given polypeptide may contain many types of modifications. Polypeptides 
may be branched, for example, as a result of ubiquitination, and they may be cyclic, with or 
without branching. Cyclic, branched, and branched cyclic polypeptides may result fi-om 
posttranslation natural processes or may be made by S3aithetic methods. Modifications include 

10 acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent 
attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide derivative, 
covalent attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, 
cross-linking, cyclization, disulfide bond formation, demethylation, formation of covalent 
cross-links, formation of cysteine, formation of pyroglutamate, formulation, gamma- 

15 carboxylation, glycosylation, GPI anchor formation, hydroxy! ation, iodination, methylation, 
myristoylation, oxidation, pegylation, proteolytic processing, phosphorylation, prenylation, 
racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination. (See, e.g., T, E. Creighton, Ed., Proteins — 
Structure And Molecular Properties^ 2nd Ed., W. H. Freeman and Company, New York (1993); 

20 B. C. Johnson, Ed., Posttranslational Covalent Modification Of Proteins^ Academic Press, New 
York, pgs. 1-12 (1983); Seifter et aL, Meth. EnzymoL 182:626-646 (1990); Rattan et aL, Ann. 
KY. Acad, ScL 663:48-62 (1992)). 

"A polypeptide having biological activity" refers to polypeptides exhibiting activity 
25 similar, but not necessarily identical to, an activity of a polypeptide of the present invention, 
including mature forms, as measured in a particular biological assay, with or without dose- 
dependency. In the case where dose dependency does exist, it need not be identical to that of 
the polypeptide, but rather substantially similar to the dose-dependence in a given activity as 
compared to the polypeptide of the present invention (i.e., the candidate polypeptide will 
30 exhibit greater activity or not more than about 25-fold less and, preferably, not more than about 
ten-fold less activity and, most preferably, not more than about three-fold less activity relative 
to the polypeptide of the present invention). 
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The translated amino acid sequence, beginning with the methionine, is identified 
although other reading frames can also be easily translated using known molecular biology 
techniques. The polypeptides produced by the translation of these alternative open reading 
fi-ames are specifically contemplated by the present invention. 

5 

SEQ ID NOs:l-23 and the translations of SEQ ID NOs: 1-23 are sufficiently accurate 
and otherwise suitable for a variety of uses well known in the art and described further below. 
These nucleic acid molecules will also hybridize to nucleic acid molecules in biological 
samples, thereby enabling a variety of forensic and diagnostic methods of the invention. 
10 Similarly, polypeptides identified fi-om the translations of SEQ ID NOs: 1-23 may be used to 
generate antibodies which bind specifically to the secreted proteins encoded by the cDNA 
clones identified. 



Nevertheless, DNA sequences generated by sequencing reactions can contain 
15 sequencing errors. The errors exist as misidentified nucleotides, or as insertions or deletions of 
nucleotides in the generated DNA sequence. The erroneously inserted or deleted nucleotides 
cause firame shifts in the reading firames of the predicted amino acid sequence. In these cases, 
the predicted amino acid sequence diverges fi-om the actual amino acid sequence, even though 
the generated DNA sequence may be greater than 99,9% identical to the actual DNA sequence 
20 (for example, one base insertion or deletion in an open reading fi'ame of over 1,000 bases). 



The present invention also relates to the genes corresponding to polynucleotides 
identified in SEQ ID NOs: 1-23, and translations of polynucleotides identified in SEQ ID 
NOs: 1-23. The corresponding gene can be isolated in accordance with known methods using 
25 the sequence information disclosed herein. Such methods include preparing probes or primers 
firom the disclosed sequence and identifying or amplifying the corresponding gene firom 
appropriate sources of genomic material. 



Also provided in the present invention are species homologues. Species homologues 
30 may be isolated and identified by making suitable probes or primers firom the sequences 
provided herein and screening a suitable nucleic acid source for the desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
35 polypeptides, synthetically produced polypeptides, or polypeptides produced by a combination 
of these methods. Means for preparing such polypeptides are well understood in the art. 
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The polypeptides may be in the form of the secreted protein, including the mature form, 
or may be a part of a larger protein, such as a fusion protein (see below). It is often 
advantageous to include an additional amino acid sequence which contains secretory or leader 
sequences, pro-sequences, sequences which aid in purification (such as multiple histidine 
5 residues), or an additional sequence for stability during recombinant production. 

The polypeptides of the present invention are preferably provided in an isolated form, 
and preferably are substantially purified. A recombinantly produced version of a polypeptide, 
including the secreted polypeptide, can be substantially purified by the one-step method 
10 described in Smith & Johnson, Gene^ 67:31-40 (1988), Polypeptides of the invention also can 
be purified from natural or recombinant sources using antibodies of the invention raised against 
the secreted protein in methods which are well known in the art. 

Signal Sequences 

15 Methods for predicting whether a protein has a signal sequence, as well as the cleavage 

point for that sequence, are available. For instance, the method of McGeoch uses the 
information from a short N-terminal charged region and a subsequent uncharged region of the 
complete (uncleaved) protein {Vims Res., 3:271-286 (1985)). The method of von Heinje uses 
the information from the residues surrounding the cleavage site, typically residues -13 to +2, 

20 where +1 indicates the amino terminus of the secreted protein {Nucleic Acids Res. 14:4683- 
4690 (1986)). Therefore, from a deduced amino acid sequence, a signal sequence and mature 
sequence can be identified. 

In the present case, the deduced amino acid sequence of the secreted polypeptide was 
25 analyzed by a computer program called Signal P (Nielsen et al.. Protein Engineering, 10; 1-6 
(1997), which predicts the cellular location of a protein based on the amino acid sequence. As 
part of this computational prediction of localization, the methods of McGeoch and von Heinje 
are incorporated. 

30 As one of ordinary skill would appreciate, however, cleavage sites sometimes vary from 

organism to organism and cannot be predicted with absolute certainty. Accordingly, the present 
invention provides secreted polypeptides having a sequence corresponding to the translations of 
SEQ. ID NO: 1-23 which have an N-terminus beginning within 5 residues (i.e., + or - 5 
residues) of the predicted cleavage point. Similarly, it is also recognized that in some cases, 

35 cleavage of the signal sequence from a secreted protein is not entirely uniform, resulting in 

more than one secreted species. These polypeptides, and the polynucleotides encoding such 

polypeptides, are contemplated by the present invention. 
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Moreover, the signal sequence identified by the above analysis may not necessarily 
predict the naturally occurring signal sequence. For example, the naturally occurring signal 
sequence may be further upstream from the predicted signal sequence. However, it is likely 
5 that the predicted signal sequence will be capable of directing the secreted protein to the ER. 
These polypeptides, and the polynucleotides encoding such polypeptides, are contemplated by 
the present invention. 

Polynucleotide and Polypeptide Variants 

10 "Variant" refers to a polynucleotide or polypeptide differing from the polynucleotide or 

polypeptide of the present invention, but retaining essential properties thereof In general, 
variants have close similarity overall and are identical in many regions to the polynucleotide or 
polypeptide of the present invention. 

15 "Identity" per se has an art-recognized meaning and can be calculated using published 

techniques. (See, e.g., Lesk, Ed., Computational Molecular Biology^ Oxford University Press, 
New York, (1988); Smith, Ed., Biocomputing: Informatics And Genome Projects^ Academic 
Press, New York, (1993); Griffin, and Griffin, Eds., Computer Analysis Of Sequence Data, 
Part /, Humana Press, New Jersey, ( 1 994); von Heinje, Sequence Analysis In Molecular 

20 Biology^ Academic Press, (1987); and Gribskov and Devereux, Eds., Sequence Analysis Primer ^ 
M Stockton Press, New York, (1991)). While there exists a nimiber of methods to measure 
identity between two polynucleotide or polypeptide sequences, the term "identity" is well 
known to skilled artisans (Carillo et al,, SIAM J Applied Math,, 48: 1073 (1988)). Methods 
commonly employed to determine identity or similarity between two sequences include, but are 

25 not limited to, those disclosed in "Guide to Huge Computers," Martin J. Bishop, Ed., Academic 
Press, San Diego, (1994) and Carillo et aL, (1988), Supra. Methods for aligning 
polynucleotides or polypeptides are codified in computer programs, including the GCG 
program package (Devereux et al., Nuc. Acids Res. 12:387 (1984)), BLASTP, BLASTN, 
FASTA (Atschul et aL, J, Molec. BioL 215:403 (1990)), and Bestfit program (Wisconsin 

30 Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University 
Research Park, 575 Science Drive, Madison, WI 5371 1) which uses the local homology 
algorithm of Smith and Waterman {Adv. in App. Math., 2:482-489 (1981)). 

When using any of the sequence alignment programs to determine whether a particular 
35 sequence is, for instance, 95% identical to a reference sequence, the parameters are set such that 
the percentage of identity is calculated over the fiiU length of the reference polynucleotide and 



14 



wo 01/12647 PCT/USOO/22889 

that gaps in identity of up to 5% of the total number of nucleotides in the reference 
polynucleotide are allowed. 



A preferred method for determining the best overall match between a query sequence (a 
5 sequence of the present invention) and a subject sequence, also referred to as a global sequence 
alignment, can be determined using the FASTDB computer program based on the algorithm of 
Brutlag et aL {Comp. App. Biosci. 6:237-245 (1990)). The term "sequence" includes nucleotide 
and amino acid sequences. In a sequence alignment the query and subject sequences are either 
both nucleotide sequences or both amino acid sequences. The result of said global sequence 

10 alignment is presented in terms of percent identity. Preferred parameters used in a FASTDB 
search of a DNA sequence to calculate percent identity are: Matrix=Unitary, k-tuple=4. 
Mismatch Penalty^l, Joining Penalty=30, Randomization Group Length=0, and Cutoff Score=l, 
Gap Penalty=5, Gap Size Penalty 0.05, and Window Size=500 or query sequence length in 
nucleotide bases, whichever is shorter. Preferred parameters employed to calculate percent 

15 identity and similarity of an amino acid alignment are: Matrix=PAM 150, k-tuple=2. Mismatch 
Penal ty= 1, Joining Penalty=20, Randomization CJroup Length=0, Cutoff Score=l, Gap 
Penalty=5, Gap Size Penalty=0.05, and Window Size=500 or query sequence length in amino 
acid residues, whichever is shorter. 

20 As an illustration, a polynucleotide having a nucleotide sequence of at least 95% 

"identity" to a sequence identified in SEQ ID NOs: 1 -23 tneans that the polynucleotide is 
identical to a sequence identified in SEQ ID NOs: 1-23 or the cDNA except that the 
polynucleotide sequence may include up to five point mutations per each 1 00 nucleotides of the 
total length (not just within a given 100 nucleotide stretch). In other words, to obtain a 

25 polynucleotide having a nucleotide sequence at least 95% identical to polynucleotides identified 
in SEQ ID NOs: 1-23, up to 5% of the nucleotides in the sequence identified in SEQ ID NOs:l- 
23 or the cDNA can be deleted, inserted, or substituted with other nucleotides. These changes 
may occur an3where throughout the polynucleotide. 

30 Further embodiments of the present invention include polynucleotides having at least 

80% identity, more preferably at least 90% identity, and most preferably at least 95%, 96%, 
97%, 98% or 99% identity to a sequence identified in SEQ ID NOs: 1-23. Of course, due to the 
degeneracy of the genetic code, one of ordinary skill in the art will immediately recognize that a 
large number of the polynucleotides having at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% 

35 identity will encode a polypeptide identical to an amino acid sequence contained in the 
translations of polynucleotides identified in SEQ ID NOs: 1-23. 
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Similarly, by a polypeptide having an amino acid sequence having at least, for example, 
95% "identity" to a reference polypeptide, is intended that the amino acid sequence of the 
polypeptide is identical to the reference polypeptide except that the polypeptide sequence may 
include up to five amino acid alterations per each 100 amino acids of the total length of the 
5 reference polypeptide. In other words, to obtain a polypeptide having an amino acid sequence 
at least 95% identical to a reference amino acid sequence, up to 5% of the amino acid residues 
in the reference sequence may be deleted or substituted with another amino acid, or a number of 
amino acids up to 5% of the total amino acid residues in the reference sequence may be inserted 
into the reference sequence. These alterations of the reference sequence may occur at the amino 
10 or carboxy teraiinal positions of the reference amino acid sequence or anywhere between those 
terminal positions, interspersed either individually among residues in the reference sequence or 
in one or more contiguous groups within the reference sequence. 

Further embodiments of the present invention include polypeptides having at least 80% 
15 identity, more preferably at least 85% identity, more preferably at least 90% identity, and most 
preferably at least 95%, 96%, 97%, 98% or 99% identity to an amino acid sequence contained 
in translations of polynucleotides identified in SEQ ID NOs: 1-23. Preferably, the above 
polypeptides should exhibit at least one biological activity of the protein. 

20 In a preferred embodiment, polypeptides of the present invention include polypeptides 

having at least 90% similarity, more preferably at least 95% similarity, and still more preferably 
at least 96%, 97%, 98%, or 99% similarity to an amino acid sequence contained in translations 
of polynucleotides identified in SEQ ID NOs:l-23. 

25 The variants may contain alterations in the coding regions, non-coding regions, or both. 

Especially preferred are polynucleotide variants containing alterations which produce silent 
substitutions, additions, or deletions, but do not alter the properties or activities of the encoded 
polypeptide. Nucleotide variants produced by silent substitutions due to the degeneracy of the 
genetic code are preferred. Moreover, variants in which 5-10, 1-5, or 1-2 amino acids are 

30 substituted, deleted, or added in any combination are also preferred. Polynucleotide variants 
can be produced for a variety of reasons. For instance, a polynucleotide variant may be 
produced to optimize codon expression for a particular host (i.e., codons in the human inRNA 
may be changed to those preferred by a bacterial host, such as E. coli). 

35 Naturally occurring variants are called "allelic variants," and refer to one of several 

altemate forms of a gene occupying a given locus on a chromosome of an organism (Lewin, B., 
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Ed., Genes 11^ John Wiley & Sons, New York (1985)). These allelic variants can vary at either 
the polynucleotide and/or polypeptide level. Alternatively, non-naturally occurring variants 
may be produced by mutagenesis techniques or by direct synthesis. 

5 Using known methods of protein engineering and recombinant DNA technology, 

variants may be generated to improve or alter the characteristics of the polypeptides of the 
present invention. For instance, one or more amino acids can be deleted from the N-terminus or 
C-terminus of the secreted protein without substantial loss of biological function, Ron et al. 
reported variant KGF proteins having heparin binding activity even after deleting 3, 8, or 27 
10 amino-terminal amino acid residues. (7. ^/o/. C/z^/w. 268: 2984-2988 (1993)). Similarly, 
interferon gamma exhibited up to ten times higher activity after deleting 8-10 amino acid 
residues from the carboxy terminus of this protein (Dobeli et al., J. Biotechnology, 7:199-216 
(1988)). 

1 5 Moreover, ample evidence demonstrates that variants often retain a biological activity 

similar to that of the naturally occurring protein. For example, Gayle et al. conducted extensive 
mutational analysis of human cytokine IL-la {J. Biol. Chem., 268:22105-221 1 1 (1993)). These 
investigators used random mutagenesis to generate over 3,500 individual IL-la mutants that 
averaged 2.5 amino acid changes per variant over the entire length of the molecule. Multiple 

20 mutations were examined at every possible amino acid position. The investigators concluded 
that "[m]ost of the molecule could be altered with little effect on either [binding or biological 
activity]." (See, Gayle et al. (1993), Abstract). In fact, only 23 unique amino acid sequences, 
out of more than 3,500 nucleotide sequences examined, produced a protein that differed 
significantly in activity from the wild-type sequence. 

25 

Furthermore, even if deleting one or more amino acids from the N-terminus or C- 
terminus of a polypeptide results in modification or loss of one or more biological fimctions, 
other biological activities may still be retained. For example, the ability of a deletion variant to 
induce and/or to bind antibodies which recognize the secreted form will likely be retained when 
30 less than the majority of the residues of the secreted form are removed from the N-terminus or 
C-terminus. Whether a particular polypeptide lacking N- or C-terminal residues of a protein 
retains such immunogenic activities can readily be determined by routine methods described 
herein and otherwise known in the art. 

35 Thus, the invention fiirther includes polypeptide variants which show substantial 

biological activity. Such variants include deletions, insertions, inversions, repeats, and 
substitutions selected according to general mles known in the art so as have little effect on 
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activity. For example, guidance concerning how to make phenotypically silent amino acid 
substitutions is provided in Bowie, et aL, Science 247: 1306-13 10 (1990), wherein the authors 
indicate that there are two main strategies for studying the tolerance of an amino acid sequence 
to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural selection 
during the process of evolution. By comparing amino acid sequences in different species, the 
amino acid positions which have been conserved between species can be identified. These 
conserved amino acids are likely important for protein fianction. In contrast, the amino acid 
positions in which substitutions have been tolerated by natural selection indicate positions 
which are not critical for protein function. Thus, positions tolerating amino acid substitution 
may be modified while still maintaining biological activity of the protein. 

The second strategy uses genetic engineering to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein function. For example, 
site-directed mutagenesis or alanine-scanning mutagenesis (the introduction of single alanine 
mutations at every residue in the molecule) can be used (Cunningham et al.. Science 244: 1 08 1 - 
1085 (1989)). The resulting mutant molecules can then be tested for biological activity. 

According to Bowie et al. (1990), these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors fiirther indicate which amino acid 
changes are likely to be permissive at certain amino acid positions in the protein. For example, 
most buried or interior (within the tertiary structure of the protein) amino acid residues require 
nonpolar side chains, whereas few features of surface or exterior side chains are generally 
conserved. Moreover, tolerated conservative amino acid substitutions involve replacement of 
the aliphatic or hydrophobic amino acids Ala, Val, Leu and He; replacement of the hydroxyl 
residues Ser and Thr; replacement of the acidic residues Asp and Glu; replacement of the amide 
residues Asn and Gin; replacement of the basic residues Lys, Arg, and His; replacement of the 
aromatic residues Phe, Tyr, and Trp; and replacement of the small-sized amino acids Ala, Ser, 
Thr, Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention include: 
(i) substitutions with one or more of the non-conserved amino acid residues, where the 
substituted amino acid residues may or may not be one encoded by the genetic code; (ii) 
substitution with one or more of amino acid residues having a substituent group; (iii) fusion of 
the mature polypeptide with another compound, such as a compound to increase the stability 
and/or solubility of the polypeptide (e.g., polyethylene glycol); or (iv) fusion of the polypeptide 
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with additional amino acids, such as an IgG Fc fusion region peptide, a leader or secretory 
sequence, or a sequence facilitating purification. Such variant polypeptides are deemed to be 
within the scope of those skilled in the art fi-om the teachings herein. 

5 For example, polypeptide variants containing amino acid substitutions of charged amino 

acids with other charged or neutral amino acids may produce proteins with improved 
characteristics, such as decreased aggregation. As known, aggregation of pharmaceutical 
formulations both reduces activity and increases clearance due to the aggregate's immunogenic 
activity (see, e.g., Pinckard et al., Clin, Exp, Immunol. 2:33 1-340 (1967); Robbins et al., 
10 Diabetes 36: 838-845 (1987); Cleland et al., CriL Rev. Therap. Drug Carrier Sys. 10:307-377 
(1993)). 

Polynucleotide and Polypeptide Fragments 

In the present invention, a "polynucleotide fi-agment" refers to a short polynucleotide 
15 having a nucleic acid sequence contained in that shown in polynucleotides identified in SEQ ID 
NOs:l-23. The short nucleotide fi-agments are preferably at least about 15 nucleotides (nt), and 
more preferably at least about 20 nt, sfiU more preferably at least about 30 nt, and even more 
preferably, at least about 40 nt in length. A firagment "at least 20 nt in length," for example, is 
intended to include 20 or more contiguous bases fi-om the cDNA sequence contained in that 
20 shown in polynucleotides identified in SEQ ID NOs: 1-23. These nucleotide fi^agments are 

useful as diagnostic probes and primers £is discussed herein. Of course, larger fi-agments (e.g., 
40, 50, 150, and greater than 150 nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fi-agments of the invention, 
25 include, for example, fi-agments having a sequence fi-om about nucleotide number 1-40, 1-50, 
51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401-450, to the end of 
polynucleotides identified in SEQ ID NOs: 1-23. In this context "about" includes the 
particularly recited ranges, larger or smaller by several nucleotides (i.e., 5, 4, 3, 2, or 1 nt), at 
either terminus or at both termini. Preferably, these fragments encode a polypeptide which has 
30 biological activity. 

In the present invention, a "polypeptide Augment" refers to a short amino acid sequence 
contained in the translations of polynucleotides identified in SEQ ID NOs: 1-23. Protein 
firagments may be "firee-standing," or comprised within a larger polypeptide of which the 
35 fi*agment forms a part or region, most preferably as a single continuous region. Representative 
examples of polypeptide fragments of the invention, include, for example, fragments fi-om 
about amino acid number 1-20, 21-40, 41-60, or 61 to the end of the coding region. Moreover, 
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polypeptide fragments can be about 20, 30, 40, 50 or 60, amino acids in length. In this context 
"about" includes the particularly recited ranges, larger or smaller by several amino acids (i.e., 5, 
4, 3, 2, or 1), at either extreme or at both extremes. 

5 Preferred polypeptide fragments include the secreted protein as well as the mature form. 

Further preferred polypeptide fragments include the secreted protein or the mature form having 
a continuous series of deleted residues from the amino or the carboxy terminus, or both. For 
example, any number of amino acids ranging from 1 -60, can be deleted from the amino 
terminus of either the secreted polypeptide or the mature form. Similarly, any number of amino 
10 acids, ranging from 1-30, can be deleted from the carboxy terminus of the secreted protein or 
mature form. Furthermore, any combination of the above amino and carboxy terminus 
deletions are preferred. Similarly, polynucleotide fragments encoding these polypeptide 
fragments are also preferred. 

1 5 Also preferred are polypeptide and polynucleotide fragments characterized by structural 

or functional domains, such as fragments that comprise alpha-helix and alpha-helix forming 
regions, beta-sheet and beta-sheet-forming regions, turn and tum-forming regions, coil and coil- 
forming regions, hydrophilic regions, hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surface-forming regions, substrate binding region, and 

20 high antigenic index regions. Polypeptide fragments of the translations of polynucleotides 

identified in SEQ ID NOs:l-23 falling within conserved domains are specifically contemplated 
by the present invention. Moreover, polynucleotide fragments encoding these domains are also 
contemplated- 

25 Other preferred fragments are biologically active fragments. Biologically active 

fragments are those exhibiting activity similar, but not necessarily identical, to an activity of the 
polypeptide of the present invention. The biological activity of the fragments may include an 
improved desired activity, or a decreased undesirable activity. 

30 Epitopes & Antibodies 

In the present invention, "epitopes" refer to polypeptide fragments having antigenic or 
immunogenic activity in an animal, especially in a human. A preferred embodiment of the 
present invention relates to a polypeptide fragment comprising an epitope, as well as the 
polynucleotide encoding this fragment. A region of a protein molecule to which an antibody 
35 can bind is defined as an "antigenic epitope." In contrast, an "immunogenic epitope" is defined 
as a part of a protein that elicits an antibody response. (See, e.g. Geysen et al., Proc. Natl. Acad. 
Sci. USA, 81:3998-4002(1983)). 
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Fragments which function as epitopes may be produced by any conventional means. 
(See, e.g., Houghten, R. A., Proc. Natl. Acad, ScL USA, 82:5131-5135 (1985) fiirther described 
in U.S. Patent No. 4,631,21 1). 

5 

In the present invention, antigenic epitopes preferably contain a sequence of at least 
seven, more preferably at least nine, and most preferably between about 1 5 to about 30 amino 
acids. Antigenic epitopes are useful to raise antibodies, including monoclonal antibodies, that 
specifically bind the epitope. (See, e.g., Wilson et aL, Cell, 31:161-11^ (1984); Sutcliffe et al., 
10 Science. 219:660-666 (1983)). 



Similarly, immunogenic epitopes can be used to induce antibodies according to methods 
well known in the art. (See, e.g., Sutcliffe et aL, (1983) Supra\ Wilson et al., (1984) Supra\ 
Chow et al., Proc. Natl, Acad. ScL USA, 82:910-914; and Bittle et al., J. Gen. ViroL, 66:2347- 

15 2354(1985)). A preferred immunogenic epitope includes the secreted protein. The 

immunogenic epitope may be presented together with a carrier protein, such as an albumin, to 
an animal system (such as rabbit or mouse). Alternatively, the immunogenic epitope may be 
prescribed without a carrier if the sequence is of sufficient length (at least about 25 amino 
acids). However, immunogenic epitopes comprising as few as 8 to 10 amino acids have been 

20 shown to be sufficient to raise antibodies capable of binding to, at the very least, linear epitopes 
in a denatured polypeptide (e.g., in Westem blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to 
include intact molecules as well as antibody fragments (such as, for example. Fab and F(ab')2 

25 fragments) which are capable of specifically binding to protein. Fab and F(ab')2 fi-agments lack 
the Fc fragment of intact antibody, clear more rapidly from the circulation, and may have less 
non-specific tissue binding than an intact antibody. (Wahl et al., J. NucL Med. 24:3 16-325 
(1983)), Thus, these fragments are preferred, as well as the products of a Fab or other 
immunoglobulin expression library. Moreover, antibodies of the present invention include 

30 chimeric, single chain, human, and humanized antibodies. 

Additional embodiments include chimeric antibodies, e.g., humanized versions of 
murine monoclonal antibodies. Such humanized antibodies may be prepared by known 
techniques, and offer the advantage of reduced immunogenicity when the antibodies are 
35 administered to humans. In one embodiment, a humanized monoclonal antibody comprises the 
variable region of a murine antibody (or just the antigen binding site thereof) and a constant 



21 



wo 01/12647 PCT/USOO/22889 
region derived from a human antibody. Altematively, a humanized antibody fragment may 
comprise the antigen binding site of a murine monoclonal antibody and a variable region 
fragment (lacking the antigen-binding site) derived from a human antibody. Procedures for the 
production of chimeric and fiirther engineered monoclonal antibodies include those described in 
5 Riechmann et al. {Nature, 332:323, 1988), Liu et al. {PNAS, 84:3439, 1987), Larrick et aL 
{Bio/Technology, 7:934, 1989), and Winter and Harris {TIPS, 14:139, May, 1993). 

One method for producing an antibody comprises immimizing a non-human animal, 
such as a transgenic mouse, with a polypeptide translated from a nucleotide sequence chosen 

10 from SEQ ID NOs:l-23, whereby antibodies directed against the polypeptide translated from a 
nucleotide sequence chosen from SEQ ID NOs:l-23 £ure generated in said animal. Procedures 
have been developed for generating human antibodies in non-human animals. The antibodies 
may be partially human, or preferably completely human. Non-human animals (such as 
transgenic mice) into which genetic material encoding one or more human immunoglobulin 

15 chains has been introduced may be employed. Such transgenic mice may be genetically altered 
in a variety of ways. The genetic manipulation may result in human immunoglobulin 
polypeptide chains replacing endogenous immunoglobulin chains in at least some (preferably 
virtually all) antibodies produced by the animal upon immunization. Antibodies produced by 
inmiunizing transgenic animals with a polypeptide translated from a nucleotide sequence 

20 chosen from SEQ ID NOs: 1-23 are provided herein. 

Mice in which one or more endogenous immunoglobulin genes are inactivated by 
various means have been prepared. Human immunoglobulin genes have been introduced into 
the mice to replace the inactivated mouse genes. Antibodies produced in the animals 
25 incorporate human immunoglobulin polypeptide chains encoded by the human genetic material 
introduced into the animal. Examples of techniques for production and use of such transgenic 
animals are described in U.S. Patent Nos. 5,814,318, 5,569,825, and 5,545,806, which are 
incorporated by reference herein. 

30 Monoclonal antibodies may be produced by conventional procedures, e.g., by 

immortalizing spleen cells harvested from the transgenic animal after completion of the 
immunization schedule. The spleen cells may be frised with myeloma cells to produce 
hybridomas, by conventional procedures. 
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A method for producing a hybridoma cell line comprises immunizing such a transgenic 
animal with a immunogen comprising at least seven contiguous amino acid residues of a 
polypeptide translated from a nucleotide sequence chosen from SEQ ID NOs: 1-23; harvesting 
spleen cells from the immunized animal; frising the harvested spleen cells to a myeloma cell 
5 line, thereby generating hybridoma cells; and identifying a hybridoma cell line that produces a 
monoclonal antibody that binds a polj^eptide translated from a nucleotide sequence chosen 
from SEQ ID NOs: 1 -23. Such hybridoma cell lines, and monoclonal antibodies produced 
therefrom, are encompassed by the present invention. Monoclonal antibodies secreted by the 
hybridoma cell line are purified by conventional techniques. 

10 

Antibodies may be employed in an in vitro procedure, or administered in vivo to inhibit 
biological activity induced by a polypeptide translated from a nucleotide sequence chosen from 
SEQ ID NOs: 1-23. Disorders caused or exacerbated (directly or indirectly) by the interaction 
of such polypeptides of the present invention with cell surface receptors thus may be treated. A 
15 therapeutic method involves in vivo administration of a blocking antibody to a mammal in an 
amount effective for reducing a biological activity induced by a polypeptide translated from a 
nucleotide sequence chosen from SEQ ID NOs: 1-23. 



Also provided herein are conjugates comprising a detectable (e.g., diagnostic) or 
20 therapeutic agent, attached to an antibody directed against a polypeptide translated from a 

nucleotide sequence chosen from SEQ ID NOs: 1-23. Examples of such agents are well known, 
and include but are not limited to diagnostic radionuclides, therapeutic radionuclides, and 
C5^otoxic drugs. The conjugates find use in in vitro or in vivo procedures. 



25 Fusion Proteins 

Any polypeptide of the present invention can be used to generate fijsion proteins. For 
example, the polypeptide of the present invention, when fiised to a second protein, can be used 
as an antigenic tag. Antibodies raised against the polypeptide of the present invention can be 
used to indirectly detect the second protein by binding to the polypeptide. Moreover, because 

30 secreted proteins target cellular locations based on trafficking signals, the polypeptides of the 
present invention can be used as targeting molecules once fiised to other proteins. 



Examples of domains that can be fiised to polypeptides of the present invention include 
not only heterologous signal sequences, but also other heterologous fiinctional regions. The 
35 fusion does not necessarily need to be direct, but may occur through linker sequences. 
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Moreover, fusion proteins may also be engineered to improve characteristics of the 
polypeptide of the present invention. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the polypeptide to 
5 improve stability and persistence during purification from the host cell or subsequent handling 
and storage. Also, peptide moieties may be added to the polypeptide to facilitate purification. 
Such regions may be removed prior to final preparation of the polypeptide. The addition of 
peptide moieties to facilitate handling of polypeptides are familiar and routine techniques in the 
art. 

10 

In addition, polypeptides of the present invention, including fragments and specifically 
epitopes, can be combined with parts of the constant domain of immunoglobulins (IgG), 
resulting in chimeric polypeptides. These fiision proteins facilitate purification and show an 
increased half-life in vivo. One reported example describes chimeric proteins consisting of the 

1 5 first two domains of the human CD4-polypeptide and various domains of the constant regions 
of the heavy or light chains of mammalian immunoglobulins. (EP A 394,827; Traunecker et aL, 
Nature^ 331 :84-86 (1988)). Fusion proteins having disulfide-linked dimeric structures (due to 
the IgG) can also be more efficient in binding and neutralizing other molecules, than the 
monomeric secreted protein or protein fragment alone. (Fountoulakis et aL, J. Biochem., 

20 270:3958-3964 (1995)). 

Similarly, EP A 0 464 533 (Canadian counterpart 2045869) discloses fiision proteins 
comprising various portions of constant region of immunoglobulin molecules together with 
another human protein or part thereof. In many cases, the Fc part in a fiision protein is 

25 beneficial in therapy and diagnosis, and thus can result in, for example, improved 

pharmacokinetic properties (see, e.g., EP A 0 232 262). Altematively, deleting the Fc part after 
the fiision protein has been expressed, detected, and purified, would be desired. For example, 
the Fc portion may hinder therapy and diagnosis if the fiision protein is used as an antigen for 
immunizations. In drug discovery, for example, human proteins, such as hIL-5, have been 

30 fiised with Fc portions for the purpose of high-throughput screening assays to identify 

antagonists of hIL-5. (See, Bennett et aL, J. Mol Recognition 8:52-58 (1995); Johanson et al., J. 
Biol, Chem., 270:9459-9471 (1995)). 

Moreover, the polypeptides of the present invention can be fiised to marker sequences, 

35 such as a peptide which facilitates purification of the fused polypeptide. In preferred 

embodiments, the marker amino acid sequence is a hexa-histidine peptide, such as the tag 

provided in a pQE vector (QIAGEN, Inc., Chatsworth, CA), among others, many of which are 
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commercially available. As described in Gentz et aL, for instance, hexa-histidine provides for 
convenient purification of the fusion protein (Proc. Natl. Acad. ScL USA 86:821-824 (1989)). 
Another peptide tag usefiil for purification, the "HA" tag, corresponds to an epitope derived 
fi-om the influenza hemagglutinin protein. (Wilson et al.. Cell 37:767 (1984)). 

5 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

Vectors, Host Cells^ and Protein Production 

10 The present invention also relates to vectors containing the polynucleotide of the present 

invention, host cells, and the production of polypeptides by recombinant techniques. The 
vector may be, for example, a phage, plasmid, viral, or retroviral vector. Retroviral vectors may 
be replication competent or replication defective. In the latter case, viral propagation generally 
will occur only in complementing host cells. 

15 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a 
calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, it 
may be packaged in vitro using an appropriate packaging cell line and then transduced into host 
20 cells. 

The polynucleotide insert should be operatively linked to an appropriate promoter, such 
as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac promoters, the SV40 early 
and late promoters and promoters of retroviral LTRs, to name a few. Other suitable promoters 
25 will be known to the skilled artisan. The expression constructs will further contain sites for 
transcription initiation, termination, and, in the transcribed region, a ribosome binding site for 
translation. The coding portion of the transcripts expressed by the constructs will preferably 
include a translation initiating codon at the beginning and a termination codon (UAA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

30 

As indicated, the expression vectors will preferably include at least one selectable 
marker. Such markers include dihydrofolate reductase, G418 or neomycin resistance for 
eukaryotic cell culture and tetracycline, kanamycin or ampicillin resistance genes for culturing 
in E. coli and other bacteria. Representative examples of appropriate hosts include, but are not 
35 limited to, bacterial cells, such as E. coli^ Streptomyces and Salmonella typhimurium cells; 

fungal cells, such as yeast cells; insect cells such as Drosophila S2 anA Spodoptera S© cells; 
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animal cells such as CHO, COS, 293, and Bowes melanoma cells, and plant cells. Appropriate 
culture mediums and conditions for the above-described host cells are known in the art. 



Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, PNH16a, 
PNH18A, pNH46A, available from Stratagene Cloning Systems, Inc.; and ptrc99a, pKK223-3, 
pKK233-3, pDR540, pRIT5 available from Pharmacia Biotech, Inc. Among preferred 
eukaryotic vectors are pWLNEO, pS V2CAT, pOG44, pXTl and pSG available from 
Stratagene; and pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable 
vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium phosphate 
transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, 
electroporation, transduction, infection, or other methods. Such methods are described in many 
standard laboratory manuals, such as Davis et aL, Basic Methods In Molecular Biology (1986). 
It is specifically contemplated that the polypeptides of the present invention may, in fact, be 
expressed by a host cell lacking a recombinant vector. 

A polypeptide of this invention can be recovered and purified from recombinant cell 
cultures by well-known methods including anraionium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Most preferably, high performance liquid 
chromatography ("HPLC") is employed for purification. 

Polypeptides of the present invention, and preferably the secreted form, can also be 
recovered from products purified from natured sources, including bodily fluids, tissues and 
cells, whether directly isolated or cultured; products of chemical synthetic procedures; and 
products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, 
for example, bacterial, yeast, higher plant, insect, and mammalian cells. Depending upon the 
host employed in a recombinant production procedure, the polypeptides of the present invention 
may be glycosylated or may be non-glycosylated. In addition, polypeptides of the invention 
may also include an initial modified methionine residue, in some cases as a result of host- 
mediated processes. Thus, it is well known in the art that the N-terminal methionine encoded 
by the translation initiation codon generally is removed with high efficiency from any protein 
after translation in all eukaryotic cells. While the N-terminal methionine on most proteins also 
is efficiently removed in most prokaryotes, for some proteins, this prokaryotic removal process 
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is inefficient, depending on the nature of the amino acid to which the N-terminal methionine is 
covalently Hnked. 



Uses of the Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as reagents. 
The following description should be considered exemplary and utilizes known techniques. 

The polynucleotides of the present invention are useful for chromosome identification. 
There exists an ongoing need to identify new chromosome markers, since few chromosome 
marking reagents based on actual sequence data (repeat polymorphisms) are presently available. 
Each polynucleotide of the present invention can be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PGR primers 
(preferably 15-25 bp) from the sequences shown in polynucleotides identified in SEQ ID 
NOs: 1-23. Primers can be selected using computer analysis so that primers do not span more 
than one predicted exon in the genomic DNA. These primers are then used for PGR screening 
of somatic cell hybrids containing individual human chromosomes. Only those hybrids 
containing the human gene corresponding to the polynucleotides identified in SEQ ID NOsrl- 
23 will yield an amplified fragment- 
Similarly, somatic hybrids provide a rapid method of PGR miEipping the polynucleotides 
to particular chromosomes. Three or more clones can be assigned per day using a single 
thermal cycler. Moreover, sublocalization of the polynucleotides can be achieved with panels 
of specific chromosome fragments. Other gene-mapping strategies that can be used include in 
situ hybridization, prescreening with labeled flow-sorted chromosomes, and preselection by 
hybridization to construct chromosome specific-cDNA libraries. 

Precise chromosomal location of the polynucleotides can also be achieved using 
fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This technique 
uses polynucleotides as short as 500 or 600 bases; however, polynucleotides of 2,000-4,000 bp 
are preferred. For a review of this technique, see Verma et al.. Human Chromosomes: a 
Manual of Basic Techniques^ Pergamon Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to mark a 
single chromosome or a single site on that chromosome) or in panels (for marking multiple sites 
and/or multiple chromosomes). Preferred polynucleotides correspond to the noncoding regions 
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of the cDNAs because the coding sequences are more likely conserved within gene families, 
thus increasing the chance of cross-hybridization during chromosomal mapping. 



Once a polynucleotide has been mapped to a precise chromosomal location, the physical 
5 position of the polynucleotide can be used in linkage analysis. Linkage analysis establishes 

coinheritance between a chromosomal location and presentation of a particular disease. Disease 
mapping data are found, for example, in V. McKusick, Mendelian Inheritance in Man 
(available on line through Johns Hopkins University Welch Medical Library)). Assuming one 
megabase mapping resolution and one gene per 20 kb, a cDNA precisely localized to a 
10 chromosomal region associated with the disease could be one of 50-500 potential causative 
genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. The 
15 polynucleotides of polynucleotides identified in SEQ ID NOs:l-23 can be used for this analysis 
of individuals. 

First, visible structural alterations in the chromosomes, such as deletions or 
translocations, are examined in chromosome spreads or by PCR. If no structural alterations 

20 exist, the presence of point mutations are ascertained. Mutations observed in some or all 

affected individuals, but not in normal individuals, indicates that the mutation may cause the 
disease. However, complete sequencing of the polypeptide and the corresponding gene from 
several normal individuals is required to distinguish the mutation from a polymorphism. If a 
new polymorphism is identified, this polymorphic polypeptide can be used for further linkage 

25 analysis. 

Furthermore, increased or decreased expression of the gene in affected individuals as 
compared to unaffected individuals can be assessed using polynucleotides of the present 
invention. Any of these alterations (altered expression, chromosomal rearrangement, or 
30 mutation) can be used as a diagnostic or prognostic marker. 

In addition to the foregoing, a polynucleotide can be used to control gene expression 

through triple helix formation or antisense DNA or RNA. Both methods rely on binding of the 

polynucleotide to DNA or RNA. For these techniques, preferred pol3mucleotides are usually 20 

35 to 40 bases in length and complementary to either the region of the gene involved in 

transcription (see, Lee et al., Nuc. Acids Res.^ 6:3073 (1979); Cooney et aL, Science, 241 :456 

(1988); and Dervan et aL, Science, 251:1360 (1991) for discussion of triple helix formation) or 

28 



wo 01/12647 PCT/USOO/22889 

to the mRNA itself (see, Okano, /. Neurochem., 56:560 (1991); and Oligodeoxy-nucleotides as 
Antisense Inhibitors of Gene Expression^ CRC Press, Boca Raton, FL (1988) (for a discussion 
of antisense technique). Triple helix formation optimally results in a shut-off of RNA 
transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA 
5 molecule into polypeptide. Both techniques are effective in model systems, and the information 
disclosed herein can be used to design antisense or triple helix polynucleotides in an effort to 
treat disease. 

Polynucleotides of the present invention are also useful in gene therapy. One goal of 
10 gene therapy is to insert a normal gene into an organism having a defective gene, in an effort to 
correct the genetic defect. The polynucleotides disclosed in the present invention offer a means 
of targeting such genetic defects in a highly accurate manner. Another goal is to insert a new 
gene that was not present in the host genome, thereby producing a new trait in the host celL 

15 The polynucleotides are also useful for identifying individuals from minute biological 

samples. The United States militaiy, for example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its personnel. In this technique, an 
individual's genomic DNA is digested with one or more restriction enzymes, and probed on a 
Southern blot to yield unique bands for identifying personnel. This method does not suffer 

20 from the current limitations of "Dog Tags" which can be lost, switched, or stolen, making 
positive identification difficult. The polynucleotides of the present invention can be used as 
additional DNA markers for RFLP, 

The polynucleotides of the present invention can also be used as an alternative to RFLP, 
25 by determining the actual base-by-base DNA sequence of selected portions of an individual's 
genome- These sequences can be used to prepare PCR primers for amplifying and isolating 
such selected DNA, which can then be sequenced. Using this technique, individuals can be 
identified because each individual will have a unique set of DNA sequences. Once an unique 
ID database is established for an individual, positive identification of that individual, living or 
30 dead, can be made from extremely small tissue samples. 

Forensic biology also benefits from using DNA-based identification techniques as 
disclosed herein. DNA sequences taken from very small biological samples such as tissues, 
e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, etc., can be amplified using PCR. In 
35 one prior art technique, gene sequences amplified from poljmiorphic loci, such as DQa class II 
HLA gene, are used in forensic biology to identify individuals. (Erlich, Ed., PCR Technology^ 
M. Stockton Press (1989)). Once these specific polymorphic loci are amplified, they are 
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digested with one or more restriction enzymes, yielding an identifying set of bands on a 
Southern blot probed with DNA corresponding to the DQa class H HLA gene. Similarly, 
polynucleotides of the present invention can be used as polymorphic markers for forensic 
purposes. 

There is also a need for reagents capable of identifying the source of a particular tissue. 
Such need arises, for example, in forensics when presented with tissue of unknown origin. 
Appropriate reagents can comprise, for example, DNA probes or primers specific to particular 
tissue prepared from the sequences of the present invention. Panels of such reagents can 
identify tissue by species and/or by organ type. In a similar fashion, these reagents can be used 
to screen tissue cultures for contamination. 

In the very least, the polynucleotides of the present invention can be used as molecular 
weight markers on Southern gels; as diagnostic probes for the presence of a specific mRNA in a 
particular cell type; as a probe to "subtract-out*' known sequences in the process of discovering 
novel polynucleotides; for selecting and making oligomers for attachment to a "gene chip" or 
other support; to raise anti-DNA antibodies using DNA immunization techniques; and as an 
antigen to elicit an immune response. 

Uses of the Polypeptides 

Each of the polypeptides identified herein can be used in numerous ways. The 
foUov^ng description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a biological 
sample using antibody-based techniques. For example, protein expression in tissues can be 
studied with classical inununohistological methods. (Jalkanen, et al.,*/. Cell BioLy 101:976-985 
(1985); Jalkanen, et al., J. CellBioL, 105:3087-3096 (1987)). Other antibody-based methods 
useful for detecting protein gene expression include immunoassays, such as tfie enzyme linked 
immunosorbent assay (ELISA) and the radioinmiunoassay (RIA). Suitable antibody assay 
labels are known in the art and include enzyme labels, such as, glucose oxidase, and 
radioisotopes, such as iodine ('^^I, '^'l), carbon ("^C), sulfur (^^S), tritium (-^H), indium ("*^In), 
and technetium (^^Tc), and fluorescent labels, such as fluorescein and rhodamine, and biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins can also 

be detected in vivo by imaging. Antibody labels or markers for in vivo imaging of protein 

include those detectable by X-radiography, nuclear magnetic resonance (NMR) or ESR. For X- 

radiography, suitable labels include radioisotopes such as barium or cesium, which emit 
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detectable radiation but are not overtly harmful to the subject. Suitable markers for NMR and 
ESR include those with a detectable characteristic spin, such as deuterium, which may be 
incorporated into the antibody by labeling of nutrients for the relevant hybridoma, 

5 A protein-specific antibody or antibody fragment which has been labeled with an 

appropriate detectable imaging moiety, such as a radioisotope (e.g,, '-^'l, '*^In, ^^""Tc), a radio- 
opaque substance, or a material detectable by NMR, is introduced (e.g., parenterally, 
subcutaneously, or intraperitoneally) into the mammal. It will be understood in the art that the 
size of the subject and the imaging system used will determine the quantity of imaging moiety 

10 needed to produce diagnostic images. In the case of a radioisotope moiety, the quantity of 

radioactivity necessary for a human subject will normally range from about 5 to 20 millicuries 
of ^^'^Tc. The labeled antibody or antibody fragment will then preferentially accumulate at the 
location of cells which contain the specific protein. In vivo tumor imaging is described in 
Burchiel et al., "Immunopharmacokinetics of Radiolabeled Antibodies and Their Fragments" 

1 5 (Chapter 1 3 in Tumor Imaging: The Radiochemical Detection of Cancer^ Burchiel and Rhodes, 
Eds., Masson Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which involves (a) 
assaying the expression of a polypeptide of the present invention in cells or body fluid of an 
20 individual; and (b) comparing the level of gene expression with a standard gene expression 
level, whereby an increase or decrease in the assayed polypeptide gene expression level 
compared to the standard expression level is indicative of a disorder. 

Moreover, polypeptides of the present invention can be used to treat disease. For 
25 example, patients can be administered a polypeptide of the present invention in an effort to 
replace absent or decreased levels of the polypeptide (e.g., insulin); to supplement absent or 
decreased levels of a different polypeptide (e.g., hemoglobin S for hemoglobin B); to inhibit the 
activity of a polypeptide (e.g., an oncogene); to activate the activity of a polypeptide (e.g., by 
binding to a receptor); to reduce the activity of a membrane bound receptor by competing with 
30 it for free ligand (e.g., soluble TNF receptors used in reducing inflammation); or to bring about 
a desired response (e.g., blood vessel growth). 

Similarly, antibodies directed to a polypeptide of the present invention can also be used 
to treat disease. For example, administration of an antibody directed to a polypeptide of the 
35 present invention can bind and reduce overproduction of the pol5T)eptide. Similarly, 

administration of an antibody can activate the polypeptide, such as by binding to a polypeptide 
bound to a membrane (receptor). 
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At the very least, the polypeptides of the present invention can be used as molecular 
weight markers on SDS-PAGE gels or on molecular sieve gel filtration columns using methods 
well-known to those of skill in the art. Polypeptides can also be used to raise antibodies, which 
5 in turn are used to measure protein expression from a recombinant cell, as a way of assessing 
transformation of the host cell. Moreover, the polypeptides of the present invention can be used 
to test the following biological activities. 

Biological Activities 

10 The polynucleotides and polypeptides of the present invention can be used in assays to 

test for one or more biological activities. If these polynucleotides and polypeptides do exhibit 
activity in a particular assay, it is likely that these molecules may be involved in the diseases 
associated with the biological activity. Thus, the polynucleotides and polypeptides could be 
used to treat the associated disease. 

15 

Nervous System Activity 

A polypeptide or polj^ucleotide of the present invention may be useful in treating 

deficiencies or disorders of the central nervous system or peripheral nervous system by 

activating or inhibiting the proliferation, differentiation, or mobilization (chemotaxis) of 

20 neuroblasts, stem cells or glial cells. Also, a polypeptide or polynucleotide of the present 

invention may be useful in treating deficiencies or disorders of the central nervous system or 
peripheral nervous system by activating or inhibiting the mechanisms of synaptic transmission, 
S3mthesis, metabolism and inactivation of neural transmitters, neuromodulators and trophic 
factors, and by activating or inhibiting the expression and incorporation of enzymes, structural 

25 proteins, membrane channels and receptors in neurons and glial cells. 

The etiology of these deficiencies or disorders may be genetic, somatic (such as cancer 
or some autoimmune disorder), acquired (e.g., by chemotherapy or toxins), or infectious. 
Moreover, a polynucleotide or polypeptide of the present invention can be used as a marker or 

30 detector of a particular nervous system disease or disorder. The disorder or disease can be any 
of Alzheimer*s disease. Pick's disease, Binswanger's disease, other senile dementia, 
Parkinson's disease, parkinsonism, obsessive compulsive disorders, epilepsy, encephaolopathy, 
ischemia, alcohol addiction, drug addiction, schizophrenia, amyotrophic lateral sclerosis, 
multiple sclerosis, depression, and bipolar manic-depressive disorder, Altematively, the 

35 polypeptide or polynucleotide of the present invention can be used to study circadian variation, 
aging, or long-term potentiation, the latter affecting the hippocampus. Additionally, 
particularly with reference to mRNA species occurring in particular structures within the central 
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nervous system, the polypeptide or polynucleotide of the present invention can be used to study 
brain regions that are known to be involved in complex behaviors, such as learning and 
memory, emotion, drug addiction, glutamate neurotoxicity, feeding behavior, olfaction, viral 
infection, vision, and movement disorders. 

5 

Immune Activity 

A polypeptide or polynucleotide of the present invention may be useful in treating 
deficiencies or disorders of the immune system, by activating or inhibiting the proliferation, 
differentiation, or mobilization (chemotaxis) of immune cells. Immune cells develop through a 

10 process called hematopoiesis, producing myeloid (platelets, red blood cells, neutrophils, and 
macrophages) and lymphoid (B and T lymphocytes) cells from pluripotent stem cells. The 
etiology of these immune deficiencies or disorders may be genetic, somatic (such as cancer or 
some autoimmune disorders), acquired (e.g., by chemotherapy or toxins), or infectious. 
Moreover, a polynucleotide or polypeptide of the present invention can be used as a marker or 

1 5 detector of a particular immune system disease or disorder. 

A polynucleotide or polypeptide of the present invention may be useful in treating or 
detecting deficiencies or disorders of hematopoietic cells. A polypeptide or polynucleotide of 
the present invention could be used to increase differentiation and proliferation of 

20 hematopoietic cells, including the pluripotent stem cells, in an effort to treat those disorders 
associated with a decrease in certain (or many) types hematopoietic cells. Examples of 
immunologic deficiency syndromes include, but are not limited to: blood protein disorders (e.g. 
agammaglobulinemia, dysgammaglobulinemia), ataxia telangiectasia, common variable 
immunodeficiency, Di George's Syndrome, HIV infection, HTLV-BLV infection, leukocyte 

25 adhesion deficiency syndrome, lymphopenia, phagocyte bactericidal dysfunction, severe 

combined immunodeficiency (SCIDs), Wiskott-Aldrich Disorder, anemia, thrombocytopenia, 
or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also be used to 
30 modulate hemostatic (bleeding cessation) or thrombolytic activity (clot formation). For 

example, by increasing hemostatic or thrombolytic activity, a polynucleotide or polypeptide of 
the present invention could be used to treat blood coagulation disorders (e.g., afibrinogenemia, 
factor deficiencies), blood platelet disorders (e.g. thrombocytopenia), or wounds resulting from 
trauma, surgery, or other causes. Alternatively, a polynucleotide or polypeptide of the present 
35 invention that can decrease hemostatic or thrombolytic activity could be used to inhibit or 
dissolve clotting. These molecules could be important in the treatment of heart attacks 
(infarction), strokes, or scarring. 
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A polynucleotide or polypeptide of the present invention may also be useful in the 
treatment or detection of autoimmune disorders. Many autoimmune disorders result from 
inappropriate recognition of self as foreign material by immune cells. This inappropriate 
recognition results in an immune response leading to the destruction of the host tissue. 
Therefore, the administration of a polypeptide or polynucleotide of the present invention that 
inhibits an immune response, particularly the proliferation, differentiation, or chemotaxis of T- 
cells, may be an effective therapy in preventing autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the present 
invention include, but are not limited to: Addison*s Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, 
glomerulonephritis, Goodpasture's Syndrome, Graves* Disease, Multiple Sclerosis, Myasthenia 
Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, Pemphigus, Polyendocrinopathies, Purpura, 
Reiter*s Disease, Stiff-Man Syndrome, Autoimmune Thyroiditis, Systemic Lupus 
Erythematosus, Autoimmune Pulmonary Inflammation, Guillain-Barre Syndrome, insulin 
dependent diabetes mellitis, and autoimmune inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly allergic 
asthma) or other respiratory problems, may also be treated by a polypeptide or polynucleotide 
of the present invention. Moreover, these molecules can be used to treat anaphylaxis, 
hypersensitivity to an antigenic molecule, or blood group incompatibility. 

A polynucleotide or polypeptide of the present invention may also be used to treat 
and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ rejection occurs by 
host immune cell destruction of the transplanted tissue through an immune response. Similarly, 
an immune response is also involved in GVHD, but, in this case, the foreign transplanted 
immune cells destroy the host tissues. The administration of a polypeptide or polynucleotide of 
the present invention that inhibits an immune response, particularly the proliferation, 
differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing organ 
rejection or GVHD. 

Similarly, a polypeptide or polynucleotide of the present invention may also be used to 
modulate inflammation. For example, the polypeptide or polynucleotide may inhibit the 
proliferation and differentiation of cells involved in an inflammatory response. These 
molecules can be used to treat inflammatory conditions, both chronic and acute conditions, 
including inflammation associated with infection (e.g., septic shock, sepsis, or systemic 

34 



wo 01/12647 PCT/USOO/22889 

inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, 
arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or chemokine induced 
lung injury, inflammatory bowel disease, Crohn's disease, or resulting from over production of 
cytokines (e.g., TNF or IL-1.) 

Hvperprolifcrative Disorders 

A polypeptide or polynucleotide can be used to treat or detect hyperproliferative 
disorders, including neoplasms. A polypeptide or polynucleotide of the present invention may 
inhibit the proliferation of the disorder through direct or indirect interactions. Alternatively, a 
polypeptide or polynucleotide of the present invention may proliferate other cells which can 
inhibit the hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing antigenic 
qualities of the hyperproliferative disorder or by inducing the proliferation, differentiation, or 
mobilization of T-cells, hyperproliferative disorders can be treated. This immune response may 
be increased by either enhancing an existing immune response, or by initiating a new immune 
response. Alternatively, decreasing an immune response may also be a method of treating 
hyperproliferative disorders, such as by administering the polypeptide or pol3mucleotide as a 
chemotherapeutic agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
polynucleotide or polypeptide of the present invention include, but are not limited to neoplasms 
located in the: abdomen, bone, breast, digestive system, liver, pancreas, peritoneum, endocrine 
glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, thyroid), eye, head and neck, 
nervous system (central and peripheral), lymphatic system, pelvic region, skin, soft tissue, 
spleen, thoracic region, and urogenital system. 

Similarly, other hyperproliferative disorders can also be treated or detected by a 
polynucleotide or polypeptide of the present invention. Examples of such hyperproliferative 
disorders include, but are not limited to hypergammaglobulinemia, Ijmiphoproliferative 
disorders, paraproteinemias, purpura, sarcoidosis, Sezary Syndrome, Waldenstron's 
Macroglobulinemia, Gaucher's Disease, histiocytosis, and any other hyperproliferative disease, 
besides neoplasia, located in an organ system listed above. 

Infectious Disease 

A polypeptide or polynucleotide of the present invention can be used to treat or detect 
infectious agents. For example, by increasing the immune response, particularly increasing the 
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proliferation and differentiation of B and/or T cells, infectious diseases may be treated. The 
immune response may be increased by either enhancing an existing immune response, or by 
initiating a new immune response. Alternatively, the polypeptide or polynucleotide of the 
present invention may also directly inhibit the infectious agent, without necessarily eliciting an 
5 immune response. 

Viruses are one example of an infectious agent that can cause disease or symptoms that 
can be treated or detected by a polynucleotide or polypeptide of the present invention. 
Examples of viruses, include, but are not limited to the following DNA and RNA viral families 

10 Arbovirus, Adenoviridae, Arenaviridae, Arteri virus, Bimaviridae, Bunyaviridae, Caliciviridae, 
Circoviridae, Coronaviridae, Flaviviridae, Hepadnaviridae (Hepatitis), Herpesviridae (such as. 
Cytomegalovirus, Herpes Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, 
Morbillivirus, Rhabdoviridae), Orthomyxoviridae (e.g.. Influenza), Papovaviridae, 
Parvoviridae, Picomaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 

15 Rotavirus), Retroviridae (HTLV-I, HTLV-II, Lentivirus), and Togaviridae (e.g., Rubivirus). 

Viruses falling within these families can cause a variety of diseases or symptoms, including, but 
not limited to arthritis, bronchioUitis, encephalitis, eye infections (e.g., conjunctivitis, keratitis), 
chronic fatigue syndrome, hepatitis (A, B, C, E, Chronic Active, Delta), meningitis, 
opportunistic infections (e.g., AIDS), pneimionia, Burkitt's Lymphoma, chickenpox, 

20 hemorrhagic fever. Measles, Mumps, Parainfluenza, Rabies, the common cold. Polio, leukemia. 
Rubella, sexually transmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. A 
polypeptide or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

25 Similarly, bacterial or fungal agents that can cause disease or symptoms and that can be 

treated or detected by a pol3mucleotide or polypeptide of the present invention include, but are 
not limited to, the following Gram-Negative and Gram-positive bacterial families and fungi: 
Actinomycetales (e.g., Corynebacterium, Mycobacterium, Norcardia), Aspergillosis, 
Bacillaceae (e.g.. Anthrax, Clostridium), Bacteroidaceae, Blastomycosis, Bordetella, Borrelia, 

30 Brucellosis, Candidiasis, Campylobacter, Coccidioidomycosis, Cryptococcosis, 
Dermatocycoses, Enterobacteriaceae (Klebsiella, Salmonella, Serratia, Yersinia), 
Erysipelothrix, Helicobacter, Legionellosis, Leptospirosis, Listeria, Mycoplasmatales, 
Neisseriaceae (e.g., Acinetobacter, Gonorrhea, Menigococcal), Pasteurellacea infections (e.g., 
Actinobacillus, Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, Chlamydiaceae, 

35 Syphilis, and Staphylococcus. These bacterial or fungal families can cause the numerous 
diseases or symptoms, including, but not limited to, bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., AIDS related 
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infections), paronychia, prosthesis-related infections, Reiter's Disease, respiratory tract 
infections (such as whooping cough or empyema), sepsis, Lyme Disease, Cat-Scratch Disease, 
Dysentery, Paratyphoid Fever, food poisoning. Typhoid, pneumonia. Gonorrhea, meningitis. 
Chlamydia, Syphilis, Diphtheria, Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, 
5 gangrene, tetanus, impetigo, Rheumatic Fever, Scarlet Fever, sexually-transmitted diseases, 
skin diseases (e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, and wound 
infections. A polypeptide or polynucleotide of the present invention can be used to treat or 
detect any of these symptoms or diseases, 

10 Moreover, parasitic agents causing disease or symptoms that can be treated or detected 

by a polynucleotide or polypeptide of the present invention include, but are not limited to the 
following families Amebiasis, Babesiosis, Coccidiosis, Cryptosporidiosis, Dientamoebiasis, 
Dourine, Ectoparasitic, Giardiasis, Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, 
Trypanosomiasis, and Trichomonas. These parasites can cause a variety of diseases or 

1 5 symptoms, including, but not limited to. Scabies, Trombiculiasis, eye infections, intestinal 
disease (e.g., dysentery, giardiasis), liver disease, lung disease, opportunistic infections (e.g., 
AIDS related). Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or 
polynucleotide of the present invention can be used to treat or detect any of these symptoms or 
diseases. 

20 

Preferably, treatment using a polypeptide or polynucleotide of the present invention 
could either be by administering an effective amount of a polypeptide to the patient, or by 
removing cells from the patient, supplying the cells with a polynucleotide of the present 
invention, and retuming the engineered cells to the patient (ex vivo therapy). Moreover, the 
25 polypeptide or polynucleotide of the present invention can be used as an antigen in a vaccine to 
raise an immune response against infectious disease. 

Regeneration 

A polynucleotide or polypeptide of the present invention can be used to differentiate, 
30 proliferate, and attract cells, leading to the regeneration of tissues (see. Science 276:59-87 

(1 997)). The regeneration of tissues could be used to repair, replace, or protect tissue damaged 
by congenital defects, trauma (wounds, bums, incisions, or ulcers), age, disease (e.g. 
osteoporosis, osteocarfliritis, periodontal disease, liver failure), surgery (including cosmetic 
plastic surgery), fibrosis, reperfiision injury, or systemic cytokine damage. 

35 

Tissues that could be regenerated using the present invention include organs (e.g., 

pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac), 
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vascular (including vascular endothelium), nervous, hematopoietic, and skeletal (bone, 
cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs without scarring or with 
minimal scarring. Regeneration also may include angiogenesis. 



5 Moreover, a polynucleotide or polypeptide of the present invention may increase 

regeneration of tissues difficult to heal. For example, increased tendon/ligament regeneration 
would quicken recovery time after damage. A polynucleotide or polypeptide of the present 
invention could also be used prophylactically in an effort to avoid damage. Specific diseases 
that could be treated include of tendinitis, carpal tunnel syndrome, and other tendon or ligament 
10 defects. A further example of tissue regeneration of non-healing wounds includes pressure 
ulcers, ulcers associated with vascular insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using a polynucleotide or 
polypeptide of the present invention to proliferate and differentiate nerve cells. Diseases that 

15 could be treated using this method include central and peripheral nervous system diseeises, 

neuropathies, or mechanical and traumatic disorders (e.g., spinal cord disorders, head trauma, 
cerebrovascular disease, and stroke). Specifically, diseases associated with peripheral nerve 
injuries, peripheral neuropathy (e.g., resulting fi-om chemotherapy or other medical therapies), 
localized neuropathies, and central nervous system diseases (e.g., Al2dieimer*s disease, 

20 Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 
sjmdrome), could all be treated using the polynucleotide or polypeptide of the present 
invention. 

Chemotaxis 

25 A polynucleotide or polypeptide of the present invention may have chemotaxis activity. 

A chemotaxic molecule attracts or mobilizes cells (e.g., monocytes, fibroblasts, neutrophils, T- 
cells, mast cells, eosinophils, epithelial and/or endothelial cells) to a particular site in the body, 
such as inflammation, infection, or site of hyperproliferation. The mobilized cells can then 
fight off and/or heal the particular trauma or abnormality. 

30 

A poljmucleotide or polypeptide of the present invention may increase chemotaxic 
activity of particular cells. These chemotactic molecules can then be used to treat 
inflammation, infection, hyperproliferative disorders, or any immune system disorder by 
increasing the number of cells targeted to a particular location in the body. Forexample, 
35 chemotaxic molecules can be used to treat wounds and other trauma to tissues by attracting 

immune cells to the injured location. Chemotactic molecules of the present invention can also 
attract fibroblasts, which can be used to treat wounds. 
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It is also contemplated that a polynucleotide or polypeptide of the present invention may 
inhibit chemotactic activity. Such molecules could also be used to treat a variety of disorders. 
Thus, a polynucleotide or polypeptide of the present invention could be used as an inhibitor of 
5 chemotaxis. 

Binding Activity 

A polypeptide of the present invention may be used to screen for molecules that bind to 
the polypeptide or for molecules to which the polypeptide binds. The binding of the 
10 polypeptide and the molecule may activate (i.e., an agonist), increase, inhibit (i.e., an 

antagonist), or decrease activity of the polypeptide or the molecule bound. Examples of such 
molecules include antibodies, oligonucleotides, proteins (e.g., receptors), or small molecules. 

Preferably, the molecule is closely related to the natural ligand of the polypeptide, e.g., 
15 a fragment of the ligand, or a natural substrate, a ligand, a structural or functional mimetic (see, 
Coligan et al.. Current Protocols in Immunology 1(2), Chapter 5 (1991)). Similarly, the 
molecule can be closely related to the natural receptor to which the polypeptide binds or, at 
least, a fragment of the receptor capable of being bound by the polypeptide (e.g., an active site). 
In either case, the molecule can be rationally designed using known techniques. 

20 

Preferably, the screening for these molecules involves producing appropriate cells which 
express the polypeptide, either as a secreted protein or on the cell membrane. Preferred cells 
include cells from mammals, yeast, Drosophila, or E. colL Cells expressing the polypeptide (or 
cell membrane containing the expressed polypeptide) are then preferably contacted with a test 
25 compound potentially containing the molecule to observe binding, stimulation, or inhibition of 
activity of either the polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the polypeptide, wherein 
binding is detected by a label, or in an assay involving competition with a labeled competitor. 
30 Further, the assay may test whether the candidate compound results in a signal generated by 
binding to the polypeptide. 

Altematively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural product mixtures. 
35 The assay may also simply comprise the steps of mixing a candidate compound with a solution 
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containing a polypeptide, measuring polypeptide/molecule activity or binding, and comparing 
the polypeptide/molecule activity or binding to a standard. 



Preferably, an ELISA assay can measure polypeptide level or activity in a sample (e.g., 
biological sample) using a monoclonal or polyclonal antibody. The antibody can measure 
polypeptide level or activity by either binding, directly or indirectly, to the polypeptide or by 
competing with the polypeptide for a substrate. 

All of these above assays can be used as diagnostic or prognostic markers. The 
molecules discovered using these assays can be used to treat disease or to bring about a 
particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the 
polypeptide/molecule. Moreover, the assays can discover agents which may inhibit or enhance 
the production of the polypeptide from suitably manipulated cells or tissues. 

Therefore, the invention includes a method of identifying compounds which bind to a 
polypeptide of the invention comprising the steps of: (a) incubating a candidate binding 
compound with a polypeptide of the invention; and (b) determining if binding has occurred. 
Moreover, the invention includes a method of identifying agonists/antagonists comprising the 
steps of: (a) incubating a candidate compound with a polypeptide of the invention, (b) assaying 
a biological activity, and (c) determining if a biological activity of the polypeptide has been 
altered. 

Other Activities 

A polypeptide or polynucleotide of the present invention may also increase or decrease 
the differentiation or proliferation of embryonic stem cells from a lineage other thein the above 
described hemopoietic cells, 

A polypeptide or polynucleotide of the present invention may also be used to modulate 
mammalian characteristics, such as body height, weight, hair color, eye color, skin, percentage 
of adipose tissue, pigmentation, size, and shape (e.g., cosmetic surgery). Similarly, a 
polypeptide or polynucleotide of the present invention may be used to modulate mammalian 
metabolism affecting catabolism, anabolism, processing, utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invention may be used to change a 
mammal's mental state or physical state by influencing biorhythms, circadian rhythms, 
depression (including depressive disorders), tendency for violence, tolerance for pain, the 
response to opiates and opioids, tolerance to opiates and opioids, withdrawal from opiates and 
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opioids, reproductive capabilities (preferably by activin or inhibin-like activity), hormonal or 
endocrine levels, appetite, libido, memory, stress, or other cognitive qualities. 



A polypeptide or polynucleotide of the present invention may also be used as a food 
5 additive or preservative, such as to increase or decrease storage capabilities, fat content, lipid, 
protein, carbohydrate, vitamins, minerals, cofactors or other nutritional components. 

Other Preferred Embodiments 

1 0 Where a polynucleotide of the invention is down-regulated and exacerbates a 

pathological condition, such as ataxia telangiectasia, the expression of the polynucleotide can 
be increased or the level of the intact polypeptide product can be increased in order to treat, 
prevent, ameliorate, or modulate the pathological condition. This can be accomplished by, for 
example, administering a polynucleotide or polypeptide of the invention to the mammalian 

15 subject. 

A polynucleotide of the invention can be administered to a mammalian subject by a 
recombinant expression vector comprising the polynucleotide. A mammalian subject can be a 
human, baboon, chimpanzee, macaque, cow, horse, sheep, pig, horse, dog, cat, rabbit, guinea 
20 pig, rat or mouse. Preferably, the recombinant vector comprises a polynucleotide shown in 

SEQ ID NOs:l-23 or a polynucleotide which is at least 98% identical to a nucleic acid sequence 
shown in SEQ ID NOs:l-23. Also, preferably, the recombinant vector comprises a variant 
polynucleotide that is at least 80%, 90%, or 95% identical to a polynucleotide comprising SEQ 
IDNOs:I-23. 

25 

The administration of a polynucleotide or recombinant expression vector of the 
invention to a mammalian subject can be used to express a polynucleotide in said subject for the 
treatment of, for example, cancer or ataxia telangiectasia. Expression of a polynucleotide in 
target cells, including but not limited to neurons, tumors, and muscle cells, would effect greater 
30 production of the encoded polypeptide. 

There are available to one skilled in the art multiple viral and non-viral methods suitable 
for introduction of a nucleic acid molecule into a target cell, as described above. In addition, a 
naked polynucleotide can be administered to target cells. Polynucleotides and recombinant 
35 expression vectors of the invention can be administered as a pharmaceutical composition. Such 
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a composition comprises an effective amount of a polynucleotide or recombinant expression 
vector, and a pharmaceutically acceptable formulation agent selected for suitability with the 
mode of administration. Suitable formulation materials preferably are non-toxic to recipients at 
the concentrations employed and can modify, maintain, or preserve, for example, the pH, 
5 osmolarity, viscosity, clarity, color, isotonicity, odor, sterility, stability, rate of dissolution or 
release, adsorption, or penetration of the composition. See Remington *s Pharmaceutical 
Sciences {1%^ Ed., A.R. Gennaro, ed.. Mack Publishing Company 1990). 

The pharmaceutically active compounds (i.e., a polynucleotide or a vector) can be 
1 0 processed in accordance with conventional methods of pharmacy to produce medicinal agents 
for administration to patients, including humans and other mammals. Thus, the pharmaceutical 
composition comprising a polynucleotide or a recombinant expression vector may be made up 
in a solid form (including granules, powders or suppositories) or in a liquid form {e,g.^ 
solutions, suspensions, or emulsions). 

15 

The dosage regimen for treating a disease with a composition comprising a 
polynucleotide or expression vector is based on a variety of factors, including the type or 
severity of the cancer or AT, the age, weight, sex, medical condition of the patient, the route of 
administration, and the particular compound employed. Thus, the dosage regimen may vary 
20 widely, but can be determined routinely using standard methods, A typical dosage may range 
from about 0.1 mg/kg to about 100 mg/kg or more, depending on the factors mentioned above. 

The frequency of dosing will depend upon the pharmacokinetic parameters of the 
polynucleotide or vector in the formulation being used. Typically, a clinician will administer 

25 the composition until a dosage is reached that achieves the desired effect. The composition 
may therefore be administered as a single dose, as two or more doses (which may or may not 
contain the same amount of the desired molecule) over time, or as a continuous infusion via 
implantation device or catheter. Further refinement of the appropriate dosage is routinely made 
by those of ordinary skill in the art and is within the ambit of tasks routinely performed by 

30 them. Appropriate dosages may be ascertained through use of appropriate dose-response data. 

The cells of a manmialian subject may be transfected in vivo, ex vivo, or in vitro. 
Administration of a polynucleotide or a recombinant vector containing a polynucleotide to a 
target cell in vivo may be accomplished using any of a variety of techniques well known to 
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those skilled in the art. For example, U,S. Patent No. 5,672,344 describes an in vivo viral- 
mediated gene transfer system involving a recombinant neurotrophic HSV-1 vector. The 
above-described compositions of polynucleotides and recombinant vectors can be transfected in 
vivo by oral, buccal, parenteral, rectal, or topical administration as well as by inhalation spray. 
5 The term "parenteral" as used herein includes, subcutaneous, intravenous, intramuscular, 
intrastemal, infusion techniques or intraperitoneally. 

While the nucleic acids and/or vectors of the invention can be administered as the sole 
active pharmaceutical agent, they can also be used in combination with one or more vectors of 
10 the invention or other agents. When administered as a combination, the therapeutic agents can 
be formulated as separate compositions that are given at the same time or different times, or the 
therapeutic agents can be given as a single composition. 

Another delivery system for polynucleotides of the invention is a "non-viral" delivery 
1 5 system. Techniques that have been used or proposed for gene therapy include DNA-ligand 

complexes, adenovirus-ligand-DNA complexes, direct injection of DNA, CaP04 precipitation, 
gene gun techniques, electroporation, lipofection, and colloidal dispersion (Mulligan, R., (1993) 
Science^ 260 (51 10):926-32). Any of these methods are widely available to one skilled in the art 
and would be suitable for use in the present invention. Other suitable methods are available to 
20 one skilled in the art, and it is to be understood that the present invention may be accomplished 
using any of the available methods of transfection. Several such methodologies have been 
utilized by those skilled in the art with varying success (Mulligan, R., (1993) Science, 260 
(5110):926-32). 

25 Where a polynucleotide of the invention is up-regulated and exacerbates a pathological 

condition in a mammalian subject, such as ataxia telangiectasia, the expression of the 
polynucleotide can be blocked or reduced or the level of the intact polypeptide product can be 
reduced in order to treat, prevent, ameliorate, or modulate the pathological condition. This can 
be accomplished by, for example, the use of antisense oligonucleotides or ribozymes. 

30 Alternatively, drugs or antibodies that bind to and inactivate the polypeptide product can be 
used. 
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Antisense oligonucleotides are nucleotide sequences which are complementary to a 
specific DNA or RNA sequence. Once introduced into a cell, the complementary nucleotides 
combine with natural sequences produced by the cell to form complexes and block either 
transcription or translation. Preferably, an antisense oligonucleotide is at least 1 1 nucleotides in 
length, but can be at least 12, 15, 20, 25, 30, 35, 40, 45, or 50 or more nucleotides long. Longer 
sequences also can be used. Antisense oligonucleotide molecules can be provided in a DNA 
construct and introduced into a cell as described above to decrease the level of gene products of 
the invention in the cell. 

Antisense oligonucleotides can be deoxyribonucleotides, ribonucleotides, or a 
combination of both. Oligonucleotides can be synthesized manually or by an automated 
synthesizer, by covalently linking the 5' end of one nucleotide with the 3* end of another 
nucleotide with non-phosphodiester intemucleotide linkages such alkylphosphonates, 
phosphorothioates, phosphorodithioates, alkylphosphonothioates, alkylphosphonates, 
phosphoramidates, phosphate esters, carbamates, acetamidate, carboxymethyl esters, 
carbonates, and phosphate triesters. See Brown, (1994) Meth. MoL BioL 20:1-8; Sonveaux, 
(1994) Meth. MoL Biol, 26:1-72; Uhlmann et al., (1990) Chem, Rev,. 90:543-583. 

Modifications of gene expression can be obtained by designing antisense 
oligonucleotides which will form duplexes to the control, 5', or regulatory regions of a gene of 
the invention. Oligonucleotides derived fi*om the transcription initiation site, e,g,^ between 
positions -10 and +10 fi^om the start site, are preferred. Similarly, inhibition can be achieved 
using "triple helix" base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or chaperons. Therapeutic advances using triplex DNA have been 
described in the literature {e.g.^ Gee et al., in Huber & Carr, Molecular and Immunologic 
Approaches, Futura Publishing Co., Mt. Kisco, N.Y., 1994). An antisense oligonucleotide 
also can be designed to block translation of mRNA by preventing the transcript firom binding to 
ribosomes. 

Precise complementarity is not required for successful complex formation between an 
antisense oligonucleotide and the complementary sequence of a polynucleotide. Antisense 
oligonucleotides which comprise, for example, 2, 3, 4, or 5 or more stretches of contiguous 
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nucleotides which are precisely complementary to a polynucleotide, each separated by a stretch 
of contiguous nucleotides which are not complementary to adjacent nucleotides, can provide 
sufficient targeting specificity for mRNA. Preferably, each stretch of complementary 
contiguous nucleotides is at least 4, 5, 6, 7, or 8 or more nucleotides in length. Non- 
5 complementary intervening sequences are preferably 1 , 2, 3, or 4 nucleotides in length. One 
skilled in the art can easily use the calculated melting point of an antisense-sense pair to 
determine the degree of mismatching which will be tolerated between a particular antisense 
oligonucleotide and a particular polynucleotide sequence. 

10 Antisense oligonucleotides can be modified without affecting their ability to hybridize 

to a polynucleotide of the invention. These modifications can be intemal or at one or both ends 
of the antisense molecule. For example, intemucleoside phosphate linkages can be modified by 
adding cholesteryl or diamine moieties with varying numbers of carbon residues between the 
amino groups and terminal ribose. Modified bases and/or sugars, such as arabinose instead of 

15 ribose, or a 3*, 5'-substituted oligonucleotide in which the 3' hydroxyl group or the 5' phosphate 
group are substituted, also can be employed in a modified antisense oligonucleotide. These 
modified oligonucleotides can be prepared by methods well known in the art. See, e.g,^ 
Agrawal et al., (1992) Trends Biotechnol, 10:152-158; Uhlmann et al., (1990) Chem, Rev., 
90:543-584; Uhlmann et al., (1987) Tetrahedron. Lett., 215:3539-3542. 

20 

Ribozymes are RNA molecules with catalytic activity. See^ e.g., Cech, (1987) Science, 
236:1532-1539; Cech, {1990) Ann, Rev. Biochem., 59:543-568; Cech, (1992) Curr. Opin, 
Struct. Biol, 2:605-609; Couture & Stinchcomb, (1996) Trends Genet., 12:510-515. 
Ribozymes can be used to inhibit gene function by cleaving an RNA sequence, as is known in 
25 the art {e.g.^ Haseloff et al., U.S. Patent 5,641 ,673). The mechanism of ribozyme action 

involves sequence-specific hybridization of the ribozyme molecule to complementary target 
RNA, followed by endonucleolytic cleavage. Examples include engineered hanunerhead motif 
ribozyme molecules that can specifically and efficiently catalyze endonucleolytic cleavage of 
specific nucleotide sequences. 

30 

The coding sequence of a polynucleotide of the invention can be used to generate 
ribozymes which will specifically bind to mRNA transcribed from the polynucleotide. 
Methods of designing and constructing ribozymes which can cleave RNA molecules in trans in 
a highly sequence specific manner have been developed and described in the art (see Haseloff 
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et al. (1 988) Nature, 334:585-591). For example, the cleavage activity of ribozymes can be 
targeted to specific RNAs by engineering a discrete "hybridization" region into the ribozyme. 
The hybridization region contains a sequence complementary to the target RNA and thus 
specifically hybridizes with the target (see, e.g., Gerlach et al., EP 321,201). 

5 

Specific ribozyme cleavage sites within a RNA target can be identified by scanning the 
target molecule for ribozyme cleavage sites which include the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides 
corresponding to the region of the target RNA containing the cleavage site can be evaluated for 

10 secondary structural features which may render the teirget inoperable. Suitability of candidate 
RNA targets also can be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. The nucleotide sequences shown in SEQ 
ID NOs:l-23 and their complements provide sources of suitable hybridization region 
sequences. Longer complementary sequences can be used to increase the affinity of the 

15 hybridization sequence for the target. The hybridizing and cleavage regions of the ribozyme 
can be integrally related such that upon hybridizing to the target RNA through the 
complementary regions, the catalytic region of the ribozyme can cleave the target. 

Ribozymes can be introduced into cells as part of a DNA construct. Mechanical 
20 methods, such as microinjection, liposome-mediated transfection, electroporation, or calcium 
phosphate precipitation, can be used to introduce a ribozyme-containing DNA construct into 
cells in which it is desired to decrease polynucleotide expression. Alternatively, if it is desired 
that the cells stably retain the DNA construct, the construct can be supplied on a plasmid and 
maintained as a separate element or integrated into the genome of the cells, as is known in the 
25 art. A ribozyme-encoding DNA construct can include transcriptional regulatory elements, such 
as a promoter element, an enhancer or UAS element, and a transcriptional terminator signal, for 
controlling transcription of ribozymes in the cells. 

As taught in Haseloff et al., U.S. Patent 5,641 ,673, ribozymes can be engineered so that 
30 ribozyme expression will occur in response to factors which induce expression of a target gene. 
Ribozymes also can be engineered to provide an additional level of regulation, so that 
destruction of mRNA occurs only when both a ribozyme and a target gene are induced in the 
cells. 
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Pathological conditions or susceptibility to pathological conditions, such as ataxia 
telangiectasia, can be diagnosed using methods of the invention. Testing for expression of a 
polynucleotide of the invention or for the presence of the polynucleotide product can correlate 
with the severity of the condition and can also indicate appropriate treatment. For example, the 
5 presence or absence of a mutation in a polynucleotide of the invention can be determined and a 
pathological condition or a susceptibility to a pathological condition is diagnosed based on the 
presence or absence of the mutation. Further, an alteration in expression of a polypeptide 
encoded by a polynucleotide of the invention can be detected, where the presence of an 
alteration in expression of the polypeptide is indicative of the pathological condition or 
10 susceptibility to the pathological condition. The alteration in expression can be an increase in 
the amount of expression or a decrease in the amount of expression. 

As an additional method of diagnosis, a first biological sample from a patient suspected 
of having a pathological condition, such as ataxia telangiectasia, is obtained along with a 

15 second sample from a suitable comparable control source. A biological sample can comprise 
saliva, blood, cerebrospinal fluid, amniotic fluid, urine, feces, or tissue, such as gastrointestinal 
tissue. A suitable control source can be obtained from one or more mammalian subjects that do 
not have the pathological condition. For example, the average concentrations and distribution 
of a polynucleotide or polypeptide of the invention can be determined from biological samples 

20 taken from a representative population of mammalian subjects, wherein the mammalian 

subjects are the same species as the subject from which the test sample was obtained. The 
amount of at least one polypeptide encoded by a polynucleotide of the invention is determined 
in the first and second sample. The amounts of the polypeptide in the first and second samples 
are compared. A patient is diagnosed as having a pathological condition if the amount of the 

25 polypeptide in the first sample is greater than or less than the amount of the polypeptide in the 
second sample. 

Other preferred embodiments of the claimed invention include an isolated nucleic acid 
molecule comprising a nucleotide sequence which is at least 80%, preferably at least 85%, more 
30 preferably at least 90%, most preferably at least 95% identical to a sequence of at least about 40 
contiguous nucleotides in the nucleotide sequence of polynucleotides identified in SEQ ID 
NOs:l-23. 



35 



Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of polynucleotides identified in SEQ ID 
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NOs:l-23 in the range of positions beginning with the nucleotide at about the position of the 5' 
nucleotide of the clone sequence and ending with the nucleotide at about the position of the 3* 
nucleotide of the clone sequence. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of polynucleotides identified in SEQ ID 
NOs: 1-23 in the range of positions beginning with the nucleotide at about the position of the 5' 
nucleotide of the start codon and ending with the nucleotide at about the position of the 3* 
nucleotide of the clone sequence as defined for polynucleotides identified in SEQ ID NOs:l-23. 



Similarly preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of polynucleotides identified in SEQ ID 
NOs: 1-23 in the range of positions beginning with the nucleotide at about the position of the 5' 
nucleotide of the first amino acid of the signal peptide and ending with the nucleotide at about 
1 5 the position of the 3' nucleotide of the clone sequence as defined for polynucleotides identified 
in SEQ ID NOs: 1-23. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide sequence 
which is at least 95% identical to a sequence of at least about 150 contiguous nucleotides in the 
20 nucleotide sequence of polynucleotides identified in SEQ ID NOs: 1-23. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide sequence 
which is at least 95% identical to a sequence of at least about 500 contiguous nucleotides in the 
nucleotide sequence of polynucleotides identified in SEQ ID NOs: 1-23. 

25 

A further preferred embodiment is a nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to the nucleotide sequence of polynucleotides 
identified in SEQ ID NOs: 1-23 beginning with the nucleotide at about the position of the 5' 
nucleotide of the first amino acid of the signal peptide and ending with the nucleotide at about 
30 the position of the 3' nucleotide of the clone sequence as defined for polynucleotides identified 
in SEQ ID NOs: 1-23. 

A fiirther preferred embodiment is an isolated nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the complete nucleotide sequence of 
35 polynucleotides identified in SEQ ID NOs: 1 -23. 
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Also preferred is an isolated nucleic acid molecule which hybridizes under stringent 
hybridization conditions to a nucleic acid molecule, wherein said nucleic acid molecule which 
hybridizes does not hybridize under stringent hybridization conditions to a nucleic acid 
molecule having a nucleotide sequence consisting of only A residues or of only T residues. 

5 

A further preferred embodiment is a method for detecting in a biological sample a 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical to a 
sequence of at least 40 contiguous nucleotides in a sequence selected from the group consisting 
of: a nucleotide sequence of polynucleotides identified in SEQ ID NOs: 1-23, which method 
10 comprises a step of comparing a nucleotide sequence of at least one nucleic acid molecule in 

said sample with a sequence selected from said group and determining whether the sequence of 
said nucleic acid molecule in said sample is at least 95% identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences comprises 
15 determining the extent of nucleic acid hybridization between nucleic acid molecules in said 
sample and a nucleic acid molecule comprising said sequence selected from said group. 
Similarly, also preferred is the above method wherein said step of comparing sequences is 
performed by comparing the nucleotide sequence determined from a nucleic acid molecule in 
said sample with said sequence selected from said group. The nucleic acid molecules can 
20 comprise DNA molecules or RNA molecules. 

A further preferred embodiment is a method for identifying the species, tissue or cell 
type of a biological sample which method comprises a step of detecting nucleic acid molecules 
in said sample, if any, comprising a nucleotide sequence that is at least 95% identical to a 
25 sequence of at least 40 contiguous nucleotides in a sequence selected from the group consisting 
of: a nucleotide sequence of polynucleotides identified in SEQ ID NOs: 1-23. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene, which method comprises a step of 
30 detecting in a biological sample obtained from said subject nucleic acid molecules, if any, 
comprising a nucleotide sequence that is at least 95% identical to a sequence of at least 40 
contiguous nucleotides in a sequence selected from the group consisting of a nucleotide 
sequence of polynucleotides identified in SEQ ID NOs: 1-23. 

35 The method for diagnosing a pathological condition can comprise a step of detecting 

nucleic acid molecules comprising a nucleotide sequence in a panel of at least two nucleotide 
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sequences, wherein at least one sequence in said panel is at least 95% identical to a sequence of 
at least 40 contiguous nucleotides in a sequence selected from said group. 



Also preferred is a composition of matter comprising isolated nucleic acid molecules 
5 wherein the nucleotide sequences of said nucleic acid molecules comprise a panel of at least 
two nucleotide sequences, wherein at least one sequence in said panel is at least 95% identical 
to a sequence of at least 40 contiguous nucleotides in a sequence selected from the group 
consisting of: a nucleotide sequence of polynucleotides identified in SEQ ID NOs:l-23. The 
nucleic acid molecules can comprise DNA molecules or RNA molecules. 

10 

Also preferred is an isolated polypeptide comprising an amino acid sequence at least 
90% identical to a sequence of at least about 10 contiguous amino acids in an amino acid 
sequence translated from polynucleotides identified in SEQ ID NOs:I-23. 

15 Also preferred is a polypeptide, wherein said sequence of contiguous amino acids is 

included in amino acids in an amino acid sequence translated from polynucleotides identified in 
SEQ ID NOs:l-23, in the range of positions beginning with the residue at about the position of 
the first amino acid of the secreted portion and ending with the residue at about the last amino 
acid of the open reading frame. 

20 

Also preferred is an isolated polypeptide comprising an amino acid sequence at least 
95% identical to a sequence of at least about 30 contiguous amino acids in an amino acid 
sequence translated from polynucleotides identified in SEQ ID NOs:l-23. 

25 Further preferred is an isolated polypeptide comprising an amino acid sequence at least 

95% identical to a sequence of at least about 100 contiguous amino acids in an amino acid 
sequence translated from polynucleotides identified in SEQ ID NOs:l-23. 

Further preferred is an isolated polypeptide comprising an amino acid sequence at least 
30 95% identical to amino acids in an amino acid sequence translated from polynucleotides 
identified in SEQ ID NOs: 1-23. 

Further preferred is a method for detecting in a biological sample a polypeptide 
comprising an amino acid sequence which is at least 90% identical to a sequence of at least 1 0 
35 contiguous amino acids in a sequence selected from the group consisting of amino acid 

sequences translated from polynucleotides identified in SEQ ID NOs: 1-23, which method 
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comprises a step of comparing an amino acid sequence of at least one polypeptide molecule in 
said sample with a sequence selected from said group and detemiining whether the sequence of 
said polypeptide molecule in said sample is at least 90% identical to said sequence of at least 10 
contiguous amino acids. 

5 

Also preferred is the above method wherein said step of comparing an amino acid 
sequence of at least one polypeptide molecule in said sample with a sequence selected from said 
group comprises determining the extent of specific binding of polypeptides in said sample to an 
antibody which binds specifically to a polypeptide comprising an amino acid sequence that is at 
10 least 90% identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of amino acid sequences translated from polynucleotides identified in 
SEQIDNOs:l-23. 

Also preferred is the above method wherein said step of comparing sequences is 
15 performed by comparing the amino acid sequence determined from a polypeptide molecule in 
said sample with said sequence selected from said group. 

Also preferred is a method for identifying the species, tissue or cell type of a biological 
sample which method comprises a step of detecting polypeptide molecules in said sample, if 
20 any, comprising an amino acid sequence that is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of amino acid 
sequences translated from polynucleotides identified in SEQ ID NOs:l-23. 

Also preferred is the above method for identifying the species, tissue or cell type of a 
25 biological sample, which method comprises a step of detecting polypeptide molecules 

comprising an amino acid sequence in a panel of at least two amino acid sequences, wherein at 
least one sequence in said panel is at least 90% identical to a sequence of at least 10 contiguous 
amino acids in a sequence selected from the above group. 

30 Also preferred is a method for diagnosing in a subject a pathological condition 

associated with abnormal structure or expression of a gene, which method comprises a step of 
detecting in a biological sample obtained from said subject polypeptide molecules comprising 
an amino acid sequence in a panel of at least two amino acid sequences, wherein at least one 
sequence in said panel is at least 90% identical to a sequence of at least 10 contiguous amino 

35 acids in a sequence selected from the group consisting of amino acid sequences translated from 
polynucleotides identified in SEQ ID NOs:l-23. 
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In any of these methods, the step of detecting said polypeptide molecules includes using 
an antibody. 



Also preferred is an isolated nucleic acid molecule comprising a nucleotide sequence 
5 which is at least 95% identical to a nucleotide sequence encoding a polypeptide wherein said 
polypeptide comprises an amino acid sequence that is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the group consisting of amino acid 
sequences translated from polynucleotides identified in SEQ ID NOs:l-23. 

10 Also preferred is an isolated nucleic acid molecule, wherein said nucleotide sequence 

encoding a polypeptide has been optimized for expression of said polypeptide in a prokaryotic 
host- 
Also preferred is an isolated nucleic acid molecule, wherein said nucleotide sequence 

15 encodes a polypeptide comprising an amino acid sequence selected from the group consisting of 
amino acid sequences translated from polynucleotides identified in SEQ ID NOs:l-23. 

Further preferred is a method of making a recombinant vector comprising inserting any 
of the above isolated nucleic acid molecule into a vector. Also preferred is the recombinant 
20 vector produced by this method. Also preferred is a method of making a recombinant host cell 
comprising introducing the vector into a host cell, as well as the recombinant host cell produced 
by this method. 

Also preferred is a method of making an isolated polypeptide comprising culturing this 
25 recombinant host cell under conditions such that said polypeptide is expressed and recovering 
said polypeptide. Also preferred is this method of making an isolated polj^eptide, wherein said 
recombinant host cell is a eukaryotic cell and said polypeptide is a secreted portion of a human 
secreted protein comprising an amino acid sequence selected from the group consisting of 
amino acid sequences translated from polynucleotides identified in SEQ ID NOs:l-23. The 
30 isolated polypeptide produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an increased level of 
a secreted protein activity, which method comprises administering to such an individual a 
pharmaceutical composition comprising an amount of an isolated polypeptide, polynucleotide, 
35 or antibody of the claimed invention effective to increase the level of said protein activity in 
said individual. 
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The present invention also includes a diagnostic system, preferably in kit form, for 
assaying for the presence of the polypeptide of the present invention in a body sample, such as 
brain tissue, cell suspensions or tissue sections, or body fluid samples such as CSF, blood, 
plasma or serum, where it is desirable to detect the presence, and preferably the amount, of the 
5 polypeptide of this invention in the sample according to the diagnostic methods described 
herein. 

In a related embodiment, a nucleic acid molecule can be used as a probe (i.e., an 
oligonucleotide) to detect the presence of a polynucleotide of the present invention, a gene 

10 corresponding to a polynucleotide of the present invention, or a mRNA in a cell that is 

diagnostic for the presence or expression of a polypeptide of the present invention in the cell. 
The nucleic acid molecule probes can be of a variety of lengths from at least about 1 0 
contiguous bases, suitably about 10 to about 5000 nucleotides long, although they will typically 
be about 20 to 500 nucleotides in length. Hybridization methods are extremely well known in 

1 5 the art and will not be described further here. 

In a related embodiment, detection of genes corresponding to the polynucleotides of the 
present invention can be conducted by primer extension reactions such as the polymerase chain 
reaction (PGR). To that end, PGR primers are utilized in pairs, as is well known, based on the 
20 nucleotide sequence of the gene to be detected. Preferably, the nucleotide sequence is a portion 
of the nucleotide sequence of a polynucleotide of the present invention. Particularly preferred 
PGR primers can be derived from any portion of a DNA sequence encoding a polypeptide of 
the present invention, but are preferentially from regions which are not conserved in other 
cellular proteins. 

25 

Preferred PGR primer pairs useful for detecting the genes corresponding to the 
polynucleotides of the present invention and expression of these genes are described in the 
Examples, including the corresponding Tables. Nucleotide primers from the corresponding 
region of the polypeptides of the present invention described herein are readily prepared and 
30 used as PGR primers for detection of the presence or expression of the corresponding gene in 
any of a variety of tissues. 

The diagnostic system includes, in an amount sufficient to perform at least one assay, a 
subject polypeptide of the present invention, a subject antibody or monoclonal antibody, and/or 
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a subject nucleic acid molecule probe of the present invention, as a separately packaged 
reagent. 



In another embodiment, a diagnostic system, preferably in kit form, is contemplated for 
5 assaying for the presence of the polypeptide of the present invention or an antibody 

immunoreactive with the polypeptide of the present invention in a body fluid sample. Such 
diagnostic kit would be useful for monitoring the fate of therapeutically administered 
polypeptide of the present invention or an antibody inmiunoreactive with the polypeptide of the 
present invention. The system includes, in an amount sufficient for at least one assay, a 
10 polypeptide of the present invention and/or a subject antibody as a separately packaged 
immunochemical reagent. 

Instmctions for use of the packaged reagent(s) are also typically included. 

15 As used herein, the term "package" refers to a solid matrix or material such as glass, 

plastic (e.g., polyethylene, polypropylene or polycarbonate), paper, foil and the like capable of 
holding within fixed limits a polypeptide, polyclonal antibody or monoclonal antibody of the 
present invention. Thus, for example, a package can be a glass vial used to contain milligram 
quantities of a contemplated polypeptide or antibody or it can be a microtiter plate well to 

20 which microgram quantities of a contemplated polypeptide or antibody have been operatively 
affixed, (i.e., linked) so as to be capable of being immunologically bound by an antibody or 
antigen, respectively. 

"Instructions for use" typically include a tangible expression describing the reagent 
25 concentration or at least one assay method parameter such as the relative amounts of reagent 
and sample to be admixed, maintenance time periods for reagent/ sample admixtures, 
temperature, buffer conditions and the like. 

A diagnostic system of the present invention preferably also includes a label or 
30 indicating means capable of signaling the formation of an immunocomplex containing a 
polypeptide or antibody molecule of the present invention. 



54 



wo 01/12647 PCT/USOO/22889 
The word "complex" as used herein refers to the product of a specific binding reaction 
such as an antibody-antigen or receptor-ligand reaction. Exemplary complexes are 
inamunoreaction products. 



5 As used herein, the terms "label" and "indicating means" in their various grammatical 

forms refer to single atoms and molecules that are either directly or indirectly involved in the 
production of a detectable signal to indicate the presence of a complex. Any label or indicating 
means can be linked to or incorporated in an expressed protein, polypeptide, or antibody 
molecule that is part of an antibody or monoclonal antibody composition of the present 
10 invention, or used separately, and those atoms or molecules can be used alone or in conjunction 
with additional reagents. Such labels are themselves well-known in clinical diagnostic 
chemistry and constitute a part of this invention only insofar as they are utilized with otherwise 
novel proteins methods and/or systems. 



15 The labeling means can be a fluorescent labeling agent that chemically binds to 

antibodies or antigens without denaturing them to form a fluorochrome (dye) that is a useful 
immunofluorescent tracer. Suitable fluorescent labeling agents are fluorochromes such as 
fluorescein isocyanate (FlC), fluorescein isothiocyante (FITC), 5-dimethylamine-l- 
naphthalenesulfonyl chloride (DANSC), tetramethylrhodamine isothiocyanate (TRITC), 

20 lissamine, rhodamine 8200 sulphonyl chloride (RB 200 SC) and the like. A description of 

immunofluorescence analysis techniques is found in DeLuca, "Immunofluorescence Analysis", 
in Antibody As a Tool, Marchalonis, et al., Eds., John Wiley & Sons, Ltd., pp. 189-231 (1982), 
which is incorporated herein by reference. Other suitable labeling agents are known to those 
skilled in the art. 

25 

In preferred embodiments, the indicating group is an enzyme, such as horseradish 
peroxidase (HRP), glucose oxidase, or the like. In such cases where the principal indicating 
group is an enzyme such as HRP or glucose oxidase, additional reagents are required to 
visualize the fact that a receptor-ligand complex (immunoreactant) has formed. Such additional 
30 reagents for HRP include hydrogen peroxide and an oxidation dye precursor such as 

diaminobenzidine. An additional reagent useful with glucose oxidase is 2,2'-amino-di-(3-ethyl- 
benzthiazoline-G-sulfonic acid) (ABTS). 
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Radioactive elements are also useful labeling agents and are used illustratively herein. 
An exemplary radiolabeling agent is a radioactive element that produces gamma ray emissions. 
Elements which themselves emit gamma rays, such as '^I, '^^I and ^'Cr represent one 
class of gamma ray emission-producing radioactive element indicating groups. Particularly 
5 preferred is ^^^I. Another group of useful labeling means are those elements such as ' 'C, '^F, 
^^O and '^N which themselves emit positrons. The positrons so emitted produce gamma rays 
upon encounters with electrons present in the animal's body. Also useful is a beta emitter, such 
indium or^H. 

10 The linking of labels, and the labeling of polypeptides and proteins is well known in the 

art. For instance, antibody molecules produced by a hybridoma can be labeled by metabolic 
incorporation of radioisotope-containing amino acids provided as a component in the culture 
medium. See, e.g., Galfre et al., Meth, EnzymoL^ 73:3-46 (1981). The techniques of protein 
conjugation or coupling through activated functional groups are particularly applicable. See, 

15 e.g., Aurameas, et al., Scand. J. ImmunoL, Vol. 8 Suppl. 7:7-23 (1978), Rodwell et al., Biotech,, 
3:889-894 (1984), and U.S. Pat. No. 4,493,795. 

The diagnostic systems can also include, preferably as a separate package, a specific 
binding agent. A "specific binding agent" is a molecular entity capable of selectively binding a 
20 reagent species of the present invention or a complex containing such a species, but is not itself 
a polypeptide or antibody molecule composition of the present invention. Exemplary specific 
binding agents are second antibody molecules, complement proteins or fragments thereof, 5. 
aureus protein A, and the like. Preferably the specific binding agent binds the reagent species 
when that species is present as part of a complex. 

25 

In preferred embodiments, the specific binding agent is labeled. However, when the 
diagnostic system includes a specific binding agent that is not labeled, the agent is typically 
used as an amplifying means or reagent. In these embodiments, the labeled specific binding 
agent is capable of specifically binding the amplifying means when the amplifying means is 
30 bound to a reagent species-containing complex. 

The diagnostic kits of the present invention can be used in an "ELISA" format to detect 
the quantity of the polypeptide of the present invention in a sample. "ELISA" refers to an 
enzyme-linked immunosorbent assay that employs an antibody or antigen bound to a solid 
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phase and an enzyme-antigen or enzyme-antibody conjugate to detect and quantify the amount 
of an antigen present in a sample. A description of the ELISA technique is found in Sites et al.. 
Basic and Clinical Immunology^ 4th Ed. Chap. 22, Lange Medical Publications, Los Altos, CA 
(1982) and in U.S. Patent No. 3,654,090; U.S. Patent No. 3,850,752; and U.S. Patent No. 
5 4,016,043, which are all incorporated herein by reference. 

Thus, in some embodiments, a polypeptide of the present invention, an antibody or a 
monoclonal antibody of the present invention can be affixed to a solid matrix to form a solid 
support that comprises a package in the subject diagnostic systems. 

10 

A reagent is typically affixed to a solid matrix by adsorption fi-om an aqueous medium, 
although other modes of affixation applicable to proteins and polypeptides can be used that are 
well known to those skilled in the art. Exemplary adsorption methods are described herein. 

15 Useful solid matrices are also well known in the art. Such materials are water insoluble 

and include the cross-linked dextran available under the trademark SEPHADEX from 
Pharmacia Fine Chemicals (Piscataway, NJ), agarose, polystyrene beads of about 1 micron 
((xm) to about 5 millimeters (mm) in diameter available from several suppliers, (e.g., Abbott 
Laboratories, Chicago, IL), polyvinyl chloride, polystyrene, cross-linked polyacrylamide, 

20 nitrocellulose- or nylon-based webs (sheets, strips or paddles) or tubes, plates or the wells of a 
microliter plate, such as those made from polystyrene or polyvinylchloride. 

The reagent species, labeled specific binding agent, or amplifying reagent of any 
diagnostic system described herein can be provided in solution, as a liquid dispersion or as a 
25 substantially dry power, e.g., in lyophilized form. Where the indicating means is an enzyme, 
the enzyme's substrate can also be provided in a separate package of a system. A solid support 
such as the before-described microtiter plate and one or more buffers can also be included as 
separately packaged elements in this diagnostic assay system. 

30 The packaging materials discussed herein in relation to diagnostic systems are those 

customarily utilized in diagnostic systems. 

Having generally described the invention, the same will be more readily understood by 
reference to the following examples, which are provided by way of illustration and are not 
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EXAMPLE 1 

Identification and Characterization of Regulated Polynucleotides 

5 

Spectral Karyotyping (SKY) and Fluorescent In Situ Hybridization (FISH) oiAtm deficient T- 
cell lymphoblastic lymphomas 

Ten different Atm deficient tumor cell lines were established. The Atm deficient tumor 
10 cell lines have been analyzed with a variety of methods and shown to maintain their clonality in 
culture even with continuous passage. These cell lines were used as a source of mRNA and 
were characterized fiirther. The genetic alterations in these cell lines were characterized using 
spectral karyotyping (SKY) and fluorescent in situ hybridization (FISH). Consistent 
abnormalities of chromosome 14 and chromosome 12, similar to those found in human 
15 leukemias and lymphomas, were observed in all tumors studied. Examples of such 

abnormalities are shown in Table L The observed abnormalities included insertions (ins6:14) 
as well as translocations (tl2;10, tl4;15 and tl4;3). 

Abnormalities at the TCR locus as foimd in human hematopoietic cancer were observed. 
Genomic clones in bacterial artificial chromosomes (BACs) containing the candidate gene of 

20 interest were selected. These BACs were fluorescently labeled and used as probes on metaphase 
preparations using FISH. The results of these experiments demonstrated that both alleles of the 
TCRa locus are abnormally rearranged in all tumor cell lines studied. These results fiirther 
demonstrated that the TCR locus is rearranged in both human and mouse lymphomas. In 
addition, two of ten tumors analyzed showed chromosomal abnormalities of genes known to be 

25 involved in human tumors. 

ATM fiinction is therefore critical for maintaining appropriate recombination pathways 
and in the absence of ATM, both humans and mice develop aggressive tumors. A similar 
pathway of tumorigenesis is also likely to occur in the absence of ATM in both human and 
mouse. Further studies have involved the transformation of the normal T cell to a malignant cell 
30 as described below. 

Cytoplasmic RNA was prepared fi-om cells fi-om four tumor cell lines: AT-4, AT-7, AT- 
12 and AT- 13 that had the chromosomal abnormalities summarized in Table 1. 
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Table 1 

T-Cell Lymphomas From Attn -A Mice Show Consistent Abnormalities of Chromosomes 12 & 14 


Tumor 
Ceil Line 


No. of 
Chromosomal 
Abnormalities 


Chromosomal Abnormalities 


Gains/ 
Losses 


AT-1 


4 


T(12;14); Del (12); T(14;15); Dup(15); T(14;3); Dup(3); lns{6;14) 


2/1 


AT-2 


3 


T(12;15): T(15;12); T(4;3); Del(4); Dup(3) 


1/1 


AT-3 


4 


T(12;8); Del (12); T(13;10); Dup(10); lns(2;6); Dup(6); T(8;13) 


2/1 


AT-4 


4 


T(12;14); Del (12); T(14;15); Dup(15); T(14;X); Del(14); T(X;11); 
lns(6:14) 


2/2 


AT-5 


3 


T(12;14); Del (12); T(14;15); Dup(15); T(14;X); Del(14); Dup(X) 


2/2 


AT-7 


4 


T(12;14); Del (12); T(14;16); lns(14;1); Del(14); Del(14) +15 


1/3 


AT-10 


6 


T(12;10); Del (12); Dup(14); T(X;15); Dup(15); Del (X); T(17;1); 
Dup(1); Del(16) -11 


3/4 


AT-11 


5 


T(12;6); Del (12); Dup(6); Die (14;14); Dup(14); Die (15;15); 
Dup(15); Rb(16;16); Dup(16) +15 


5/1 


AT-12 


6 


T(12:9); Del (12); T(9;15); Dup(15); Del(14) +10 +13 +15 


4/2 


AT-13 


3 


T(12;14); Del (12); T(14;15); lns(14;15); Dup(15); Del(14) 


1/2 


(T-translocation, Dup-duplication, Del-deletion, Ins-insertion) 



Isolated RNA was analyzed using a method of simultaneous sequence-specific identification of 
mRNAs known as TOGA (TOtal Gene expression Analysis) described in Sutclifife, LG. et ah, 
Proc. Natl. Acad. ScL USA, 97(5): 1976- 1981 (2000); International published application 
5 PCT/US99/23655, U.S. Patent No. 5,459,037, U.S. Patent No. 5,807,680, U.S. Patent No. 

6,030,784, and U.S. Patent No. 6,096,503, hereby incorporated herein by reference. Preferably, 
prior to the application of the TOGA technique, the isolated RNA is enriched to form a starting 
polyA-containing mRNA population by methods known in the art. In such a preferred 
embodiment, the TOGA method further comprises an additional Polymerase Chain Reaction 

10 ("PGR") step performed using four 5' PGR primers in four separate reactions and cDNA 

templates prepared fi-om a population of antisense cRN As. A final PGR step that used 256 5' 
PGR primers in separate reactions produced PGR products that were cDNA jfragments that 
corresponded to a 3'-region of the starting mRNA population. The produced PGR products 
were then identified by a) the initial 5' sequence comprising the sequence remainder of the 

15 recognition site of the restriction endonuclease used to cut and isolate the 3' region plus the 
sequence of the preferably four parsing bases immediately 3 ' to the remainder of the 
recognition site, preferably the sequence of the entire fragment, and b) the length of the 
fragment. These two parameters, sequence and fi-agment length, were used to compare the 
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obtained PGR products to a database of known polynucleotide sequences. Since the length of 
the obtained PGR products includes known vector sequences at the 5' and 3' ends of the insert, 
the sequence of the insert provided in the sequence listing is shorter than the fragment length 
that forms part of the digital address. 

5 

The method yields Digital Sequence Tags (DSTs), that is, polynucleotides that are 
expressed sequence tags (ESTs) of the 3' end of mRNAs. DSTs that showed changes in relative 
levels when comparing 1 of 4 samples, or showed paired differences in 2 of 4 samples or were 
highly expressed and located on mouse chromosomes involved in the translocations were 
10 selected for further study. The intensities of the laser-induced fluorescence of the labeled PGR 
products were compared across samples isolated from the four tumor cell lines AT-4, AT-7, 
AT- 12 and AT- 13. The results are presented in Tables 2 and 3. 

In general, double-stranded cDNA is generated from poly(A)-enriched cytoplasmic 
15 RNA extracted from the tissue samples of interest using an equimolar mixture of all 48 5'- 

biotinylated anchor primers of a set to initiate reverse transcription. One such suitable set is G- 
A-A-T-T-C-A-A-C-T-G-G-A-A-G-C-G-G-C-G-C-G-G-A-G-G-A-A-T-T-T-T-T-T-T-T-T-T-T- 
T^T-T-T-T-T-T-V-N-N (SEQ ID NO:24), where V is A, C or G and N is A, C, G or T. One 
member of thiis mixture of 48 anchor primers initiates synthesis at a fixed position at the 3' end 
20 of all copies of each mRNA species in the sample, thereby defining a 3* endpoint for each 
species, resulting in biotinylated double stranded cDNA. 

Each biotinylated double stranded cDNA sample was cleaved with the restriction 
endonuclease MspL which recognizes the sequence CCGG. The 3' fragments of cDNA were 
25 then isolated by capture of the biotinylated cDNA fragments on a streptavidin-coated substrate. 
Suitable streptavidin-coated substrates include microtitre plates, PGR tubes, polystyrene beads, 
paramagnetic polymer beads and paramagnetic porous glass particles. A preferred streptavidin- 
coated substrate is a suspension of paramagnetic polymer beads (Dynal, Inc., Lake Success, 
NY). 

30 

After washing the streptavidin-coated substrate and captured biotinylated cDNA 
fragments, the cDNA fragment product was released by digestion with NotI, which cleaves at 
an 8-nucleotide sequence within the anchor primers but rarely within the mRNA-derived 
portion of the cDNAs. The 3' Ms p I- Not I fragments, which are of uniform length for each 
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mRNA species, were directionally ligated into Clal-, Notl-cleaved plasmid pBC SK"*" 
(Stratagene, La JoUa, CA) in an antisense orientation with respect to the vector's T3 promoter, 
and the product used to transform Escherichia coli SURE cells (Stratagene). The ligation 
regenerates the NotI site, but not the Mspl site leaving CGG as the first three bases of the 5' end 
5 of all PGR products obtained. Each library contained in excess of 5 x 10^ recombinants to 

ensure a high likelihood that the 3* ends of all mRNAs with concentrations of 0,001% or greater 
were multiply represented. Plasmid preps (Qiagen) were made from the cDNA library of each 
sample under study. 

10 An aliquot of each library was digested with Mspl. which effects linearization by 

cleavage at several sites within the parent vector while leaving the 3* cDNA inserts and their 
flanking sequences, including the T3 promoter, intact The product was incubated with T3 
RNA polymerase (MEGAscript kit, Ambion) to generate antisense cRNA transcripts of the 
cloned inserts containing known vector sequences abutting the Msp l and NotI sites from the 

15 original cDNAs. 

At this stage, each of the cRNA preparations was processed in a three-step fashion. In 
step one, 250 ng of cRNA was converted to first-strand cDNA using the 5* RT primer (A-G-G- 
T-C-G-A-C-G-G-T-A-T-C-G-G (SEQ ID NO:25). In step two, 400 pg of cDNA product was 
20 used as PGR template in four separate reactions with each of the four 5* PGR primers of the 

form G-G-T-C-G-A-C-G-G-T-A-T-C-G-G-N (SEQ ID NO:26), each paired with a "universal" 
3' PGR primer G-A-G-C-T-C-C-A-C-C-G-C-G-G-T (SEQ ID NO:27) to yield four sets of PGR 
reaction products CNl reaction products"). 

25 In step three, the product of each subpool was fiirther divided into 64 subsubpools (2ng 

in 20nl) for the second PGR reaction. This PGR reaction comprised adding 100 ng of the 
fluoresceinated "universal" 3' PGR primer (SEQ ID NO: 27) conjugated to 6-FAM and 100 ng 
of the appropriate 5' PGR primer of the form C-G-A-C-G-G-T-A-T-G-G-G-N-N-N-N (SEQ ID 
NO: 28), and using a program that included an annealing step at a temperature X slightly above 

30 the Tm of each 5* PGR primer to minimize arti factual mispriming and promote high fidelity 
copying. Each polymerase chain reaction step was performed in the presence of TaqStart 
antibody (Clonetech). 
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The products (''N4 reaction products") from the final polymerase chain reaction step for 
each of the tissue samples were resolved on a series of denaturing DNA sequencing gels using 
the automated ABI Prizm 377 sequencer. Data were collected using the GeneScan software 
package (ABI) and normaHzed for amplitude and migration. Complete execution of this series 
5 of reactions generated 64 product subpools for each of the four pools established by the 5" PGR 
primers of the first PGR reaction, for a total of 256 product subpools for the entire 5' PGR 
primer set of the second PGR reaction. 

The mRNA samples from each of the tumor cell lines as described above were analyzed. 

10 Table 2 is a summary of the expression levels of 1 1 36 mRNAs determined from cDNA. These 
cDNA molecules are identified by their digital address, that is, a partial 5' terminus nucleotide 
sequence coupled with the length of the molecule, as well as the relative amount of the 
molecule produced at different time intervals after treatment. The 5* terminus partial nucleotide 
sequence is determined by the recognition site for Mspl and the nucleotide sequence of the 

15 parsing bases of the 5' PGR primer used in the final PGR step. The digital length of the 

fragment was determined by interpolation on a standard curve, and as such, may vary plus or 
minus 1 or 2 base pairs from the actual length as determined by sequencing. 

For example, the entry in Table 2 that describes a DNA molecule identified by the 
20 digital address Mspl GAGG 1 97, is further characterized as having a 5' terminus partial 
nucleotide sequence of GGGGAGG and a digital address length of 197 b.p. The DNA 
molecule identified as Mspl GAGG 197 was further characterized as being expressed at the 
lowest level in cell line AT- 12, at higher levels in cell lines AT-7 and AT-4 and at the highest 
level in the cell line AT-1 3. 

25 

The data for Mspl GAGG 197 shown in Table 2 and Figure 1 were generated with a 5*- 
PCR primer (G-G-A-C-G-G-T-A-T-C-G-G-G-A-G-G, SEQ ID NO:29) paired with the 
"universal" 3' primer (SEQ ID NO: 27) labeled with 6-carboxyfluorescein (6FAM, ABI) at the 
5' terminus. PGR reaction products were resolved by gel electrophoresis on 4.5% acrylamide 
30 gels and fluorescence data acquired on ABI377 automated sequencers. Data were analyzed 
using GeneScan software (Perkin-Elmer). 

Similarly, the other DNA molecules identified in Table 2 by their Mspl digital addresses 
are further characterized by the pattern of the level of gene expression in the different cell lines. 
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Further examples of TOGA analysis are shown in Figures 3, 5, 7, and 9. Some PCR products, 
which were also differentially represented, appeared to migrate in positions that suggest that the 
products were novel based on comparison to data extracted from GenBank. Sequence 
alignments are shown for selected DSTs in Table 3. Table 3 summarizes the alignment 
5 information from a BLAST report generated upon comparing the cloned sequence to those in 
the GenBank/EST databases. In these cases, the PCR product is isolated, cloned into a TOPO 
vector (Invitrogen) and sequenced on both strands. In order to verify that the clones isolated are 
from the same peak, PCR primers are designed based on the determined sequence and PCR is 
performed using the cDNA produced in the first PCR reaction as substrate. Oligonucleotides 
10 were synthesized corresponding to the 5* PCR primer in the second PCR step extended at the 3' 
end with an additional 14 nucleotides from the cloned sequence. The primer sequences are 
shown in Table 4. 

TOGA analysis figures 1, 3, 5, and 7 are based on data from a single library. These 
15 experiments were replicated with a second library from the same source. Any differences 

between the length of the DST in the digital addresses in the single library versus the duplicate 
libraries are no more than two base pairs. The digital addresses reported in Table 2, the 
information in Tables 3 and 4, and Figures 9 and 10 are the product of an analysis that uses data 
obtained from both libraries. 

20 

Figure 3 is a graphical representation of the results of TOGA analysis using a 5' PCR 
primer with parsing bases GCTG, showing PCR products produced from mRNA extracted from 
(top to bottom panels) cell lines AT-4 (Fig. 3A), AT-7 (Fig. 3B), AT-12 (Fig. 3C), and AT-13 
(Fig. 3D); the arrow indicates the vertical cursor connecting the position of peaks of length 345 
25 base pairs. 

Another example is shown in Figure 5, which is a graphical representation of the results 
of TOGA analysis using a 5' PCR primer witfi parsing bases ACAT, showing PCR products 
produced from mRNA extracted from (top to bottom panels) cell lines AT-4 (Fig. 5A), AT-7 
30 (Fig. 5B), AT-12 (Fig. 5C), and AT-13 (Fig. 5D); the arrow indicates the vertical cursor 
connecting the position of peaks of length 367 base pairs. 

Figure 7 shows the results of another example of TOGA analysis, in this case using a 5' 
PCR primer with parsing bases ACGG, showing PCR products produced from mRNA extracted 
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from (top to bottom panels) cell lines AT-4 (Fig. 7A), AT-7 (Fig. 7B), AT*12 (Fig. 7C), and 
AT- 13 (Fig. 7D); the arrow indicates the vertical cursor connecting the position of peaks of 
length 458 base pairs. 

5 In one case (BARl l 1 ; SEQ ID NO:3), the DST sequence listed was cloned into a 

TOPO vector as described previously. Figure 10 illustrates the verification process, in which 
the cDNA produced in the first PGR reaction was used as template, and the PGR product 
obtained using an extended PGR primer designed based on the determined sequence was 
compared to products obtained using the original PGR primers. Oligonucleotides were 
1 0 synthesized corresponding to the 5' PGR primer in the second PGR step for each candidate 

extended at the 3' end with an additional 14 nucleotides from the cloned sequence. This clone 
specific 5' PGR primer (SEQ ID NO:32) was paired with a fluorescent labeled universal 3' 
PGR primer (SEQ ID NO:27) in PGR reactions using the cDNA produced in the first PGR 
reaction or substrate. Primers designed for such studies of cloned DSTs are shown in Table 4. 

15 

Figure 9 is a graphical representation of the results of TOGA analysis using a 5' PGR 
primer with parsing bases GGGT showing PGR products produced from mRNA extracted from 
cell lines AT-4 (Fig. 9A), AT-7 (Fig. 9B), AT- 12 (Fig, 9G), and AT- 13 (Fig. 9D); the arrow 
indicates the vertical cursor connecting the position of peaks of length 151 base pairs. The 

20 length of the PGR fragment amplified with an extended primer based on the cloned sequence 
was compared to the length of the original PGR product as shown in Figure 10. The upper 
panel (A) shows the extended primer reaction product created with a 5' PGR primer (SEQ ID 
NO:32) and the 3* PGR primer (SEQ ID NO:27). Figure lOB shows the PGR products 
produced using the original PGR primers. In the bottom panel (Figure lOG), the traces from the 

25 top and middle panels are overlaid, demonstrating that the PGR product produced using an 

extended primer based on the cloned sequence is the same length as the original PGR product. 

The map locations of several DSTs were determined from database information as 
follows: BAR1_20 (SEQ ID NO:21) maps to chromosome 13, BAR1_24 (SEQ ID NO:5) maps 
30 to chromosome 7; 62 cM, BAR1_27 (SEQ ID NO:l) maps to chromosome 14; 20.5 cM, and 
BAR1_28 (SEQ ID NO:4) maps to chromosome 1 5; 43.3 cM. 

The expression pattern of several DSTs has been assessed by Northem Blot analysis 
using RNA extracted from the above cell lines as well as other cell lines. RNA and multiple 
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northern blots have been prepared from various cell lines, including approximately 20 different 
tumor cell lines with mutations in Atm^p53 or combinations of Atm and p53. These cell lines 
have been characterized using flow cytometry and the majority are T-cell Ijmiphomas. These 
tumors are being characterized or have been characterized using SKY and FISH. 

5 

Several candidate genes have been chosen for evaluation based on their known 
localization to chromosomes that are abnormal in the specific and^^/w/p53 deficient 
tumors. Northem blots have been used to verify the gene expression patterns in the four lines 
subjected to TOGA. Candidate EST sequences were determined. EST sequences were used in 
10 BLAST searches to identify matching sequences in the GenBank database. These GenBank 
sequences were then used to generate probes as indicated. Selected examples are shown in 
Figures 2, 4, 6, and 8. 

We have chosen to study these cell lines because of the demonstrated correlation 
1 5 between chromosomal aberrations in our mouse model and those of the human disease. We 
chose several additional cell lines for northem analysis that include tumor, cell lines derived 
from mice with mutations in the tumor suppressor gene p53 as well as combinations of 
mutations of AT and p53. This will help us determine which genes are abnormally regulated 
due to loss of AT (cell lines labeled AT-#), p53 (p53-l)or combinations of both genes (APT-3 
20 is AT-/-p53+/- and 101-7, 292-3 are double mutants). We will pursue genes that are 

abnormally regulated due to loss of AT as this is the clinical phenotype we are most interested 
in and for which it appears the mouse model is an ideal for determining how abnormal 
regulation of genes give rise to human lymphoma and leukemia. 

25 Genes that are abnormally regulated among AT cell lines may be uniquely associated 

with AT tumorigenesis (as compared to other mechanisms of tumorigenesis) due to genetic 
abnormalities caused by AT deficiency. These may include phenomena such as translocations 
of lymphoid specific promoters or other regulatory sequences in or near genes promoting cell 
growth, or inactivation or deletion of genes that may regulate or suppress lymphoid tumor 
30 growth. Thus, the examples shown here were also chosen in part on the basis of their potential 
.^^ location in or near chromosomal aberrations among the AT cell lines studied. The genes 

identified in these examples may relate to specific biological aspects found both in the mouse 
model and in human AT tumorigenesis (e.g., the development of particularly aggressive forms 
of lymphomas). 
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Poly A enriched mRNA was extracted from four tumor cell lines, AT-4, AT-7, AT- 12, 
and AT- 13, as described above. The mRNA samples were electrophoresed through an agarose 
gel, blotted, and probed using well-known methods. Briefly, 20 (xg of total RNA or 2 |Lig of 
5 poly A+ mRNA was electrophoresed through a 1 .2% agarose gel containing formamide along 
with the appropriate molecular weight standards. The gel was blotted overnight using nylon 
membrane to transfer the RNA. The membrane was prehybridized for one hour at 42°C in 
hybridization buffer (5X SSPE, 5X Denhardt's solution, 50% formamide, 0.2% SDS, 100 jig/ml 
salmon sperm DNA, and water). The probe DNA (50ng) was labeled with ^^[P]-dCTP and 
10 [P]-dATP using asymmetric PCR labeling. The membrane was probed with radiolabeled 
DNA (2-5 X 1 0^ cpm/ ml) overnight at 42*^C in hybridization buffer. In addition, the northern 
blots were probed with radiolabeled cyclophylin DNA to normalize the amount of mRNA in 
each sample. Band intensities of the probed mRNA samples were quantitated using a 
Phosphoimager SI and normalized to the hybridization signal of cyclophilin. 

15 

Figure 2 is a graphical representation of the results of Northern Blot analysis using 
radiolabeled probe derived from a candidate gene (GenBank Accession Number U88908) 
mapping to mouse Chromosome 9, FISH mapped to band A2 (cytogenic) (see Liston, P., (1997) 
Genomics, 46(3):495-503) corresponding to GAGC 197, showing, in the image in the upper 
20 panel, hybridization with a band of mRNA at 4.2 kb that is expressed at various levels in cell 
lines AT-4, AT-7, AT-10, AT-11, AT-13, APT-3, P53-1, 101-7, 292-3 and thymus, but not 
detectable in cell line AT 12; the image in the lower panel shows the methylene blue stained gel 
for quantification of mRNA loading. 

25 Figure 4 is a graphical representation of the results of Northern Blot analysis using 

radiolabeled probe derived from a candidate gene (GenBank Accession Number Z31664) 
human homolog (ACVRLl- GenBank accession Z22533) maps to 12ql l-ql4, this region is 
sjnitenic to mouse chromosome 15 between 56.8 cM-64.0 cM, corresponding to GCTG 345, 
showing, in the image in the upper panel, hybridization with bands of mRNA at 4.0 kb and 3.7 

30 kb that are expressed at various levels in cell lines AT-4, AT-1 1, AT-12, AT-13, 292-3, and 

poly A^ (pA) thymus mRNA, but not detectable in cell lines AT-7, AT-10, APT-3, P53-1, 101- 
7; the image in the lower panel shows the methylene blue stained gel for quantification of 
mRNA loading. 
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Figure 6 is a graphical representation of the results of Northern Blot analysis using 
radiolabeled probe derived from a candidate gene (GenBank Accession Number U38252) 
mapping to mouse Chromosome 5, 65.0 cM corresponding to ACAT 367, showing, in the 
image in the upper panel, hybridization with a band of mRNA at 4. 1 kb that is expressed at 
5 various levels in cell lines AT-7, AT- 10, AT- 12, AT- 13 and APT-3, but not detectable in cell 
lines AT-4, AT-1 1, P53-1, 101-7, 292-3 and polyA (pA) thymus mRNA; the image in the lower 
panel shows the methylene blue stained gel for quantification of mRNA loading. 

Figure 8 is a graphical representation of the results of Northern Blot analysis using 
10 radiolabeled probe derived from a candidate gene (GenBank Accession Number X12822) 
mapping to mouse Chromosome 14, 20.5 cM corresponding to ACGG 458, showing, in the 
image in the upper panel, hybridization with bands of mRNA at 1 .7 kb and 1 . 1 kb that are 
expressed at various levels in cell lines AT-7 and AT- 10, but not detectable in cell lines AT-4, 
AT-12, AT-1 1, AT-13; APT-3, P53-1, 101-7, and 292-3, and polyA (pA) thymus (only 1.1 kb) 
1 5 mRNA; the image in the lower panel shows the methylene blue stained gel for quantification of 
mRNA loading. 

EXAMPLE 2 

Verification of Candidate Gene Targets Usinag FISH 

Once candidate genes are identified that show altered expression in the tumor cell lines, 
20 FISH is used to map the chromosomal location of the gene. Candidate genes that fall within the 
region that is rearranged in the particular cell line are identified as a gene whose change in 
expression is essential for tumorigenesis. 
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TABLE 2 


Seq ID 


Clone ID 


Digital Address 
(Mspl) 


AT4 


AT7 


AT12 


AT13 






A A A A 

AAAA 


1 yo 


45 


~T A 

74 


O A 

24 


57 






A A A A 

AAAA 




oo 

96 


A OO 

108 


O A 

81 


A n A 

184 






AAAA 

AAAA 


/I "70 

4/ o 


104 


39 


A tr A 

154 


IT O *4 

531 






A A A 

AAAU 




A A O 

1 12 


156 


A t\ A 

194 


79 






A A A 

AAAU 


o4o 


^o 

76 


195 


A O 

46 


14 






AAA 

AAAC3 


99 


54 


O «! 

61 


o A n 

948 


oo^ 

227 






AAA 

AAAG 


143 


oo 

32 


85 


22 


32 






AAA 

AAAG 


169 


92 


418 


1381 


421 






AAA 

AAAG 


183 


28 


56 


79 


58 






AAA 

AAAG 


215 


23 


60 


48 


64 






AAA 

AAAG 


225 


404 


167 


148 


123 






AAA /"^ 

AAAG 


231 


294 


132 


134 


110 






AAA £^ 

AAAG 


246 


140 


56 


48 


44 






AAA 

AAAG 


251 


100 


48 


36 


34 






AAA 

AAAG 


O f 

355 


98 


348 


312 


242 






A A ^> A 

AACA 


105 


47 


43 


69 


167 






AACA 


127 


175 


99 


356 


212 








182 


122 


19 


383 


60 






A A A 

AACA 


197 


52 


146 


55 


155 






A A A 

AACA 


201 


51 


78 


22 


72 






AACC 


235 


67 


37 


32 


18 






A A £^f^ 

AACC 


o oo 

323 


396 


328 


550 


108 






A A 

AACC 


o oo 

390 


55 


18 


14 


32 






AACC 


A O 44 

401 


56 


104 


18 


39 


6 


BAR1_16 


A A r^f^ 

AACG 


119 


2745 


448 


4106 


3652 






A A i^f^ 

AACG 


olo 


32 


54 


150 


42 






A A r^r^ 

AACG 


426 


120 


587 


306 


708 






A A r^r^ 
AACG 


4oU 


oo 

69 


565 


oo 

23 


408 






A A /^/^ 

AACG 


A A ^ 

445 


64 


22 


44 


12 






A A 

AAL> 1 


1^ / 


134 


140 


66 


170 






A A r^T 

AAO 1 


140 


oo 

96 


A A f\ 

110 


O O <4 

321 


A ^O 

153 






A A^^T 
AAL-f 1 




136/ 


O «1 0"7' 

2187 


1732 


270 






A A OX 
AAO 1 




384 


A A 

344 


136 


176 






AACT 




A A 

344 


302 


138 


A A*^ 

142 






AACT 


474 


<f 'I -4 
111 


oo 

28 


•i 4 o 

118 


108 






AAGA 


108 


^ 1 o 




1 


1 oo 






AAGA 


171 


94 


178 


50 


257 






AAGA 


234 


118 


44 


131 


129 






AAGA 


325 


38 


16 


50 


60 






AAGA 


333 


168 


226 


90 


238 






AAGC 


165 


178 


615 


334 


624 






AAGC 


300 


69 


47 


24 


36 






AAGG 


140 


425 


590 


198 


AAA 






AAGG 


251 


410 


582 


137 


432 
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/AAA VJVJJ 




47Q 


1 Rft 


o^o 


4ft4 










'-XKJ 


o^ 


1 9 


OO 












94 


"^O 
OU 


99 
ZZ 








*+\-/o 






'^O 

OU 


14 










ooo 


77fi 


7AO 
/ OU 


^ A O 






AATA 
MM 1 M 




ooy 


1 OOZ 


1 o*^ 
1 uo 


AOA 
OZO 






AATA 

MM 1 M 


1ft1 

1 O 1 


7R 


'^4 


1 1Q 

1 ly 


IZO 






AATA 

MM 1 M 


4Rn 


^^4 


'^O 
OU 


RO 
OU 


o«^ 

ZO 






AATA 

MM 1 M 


*t f o 


1 7*^ 


OOU 


OPO 
ZoU 


7>lO 
/ 4Z 






MM 1 M 


/1Q1 


oy 


zy 


4o 


OR 
ZD 






AATA 
MM 1 M 




OD 




A I 
4 1 


on 
ZU 






AATn 
MM 1 


01 R 


OZ 1 


1 0Oft 
1 UUo 


1/4 


/ Uo 






A ATr^ 
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We claim: 

1 . An isolated nucleic acid molecule comprising a polynucleotide chosen from the 
5 group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID 
NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:l4, SEQ ID NO:15, SEQ ID NO:16, SEQ 
ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, 
andSEQIDNO:23. 

10 2. An isolated polypeptide encoded by a polynucleotide chosen from the group 

consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ 
ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ 
ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 1 7, 
SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, and SEQ ID 

15 NO:23. 

3. An isolated nucleic acid molecule comprising a polynucleotide at least 95% 
identical to the isolated nucleic acid molecule of claim 1. 

4. An isolated nucleic acid molecule at least ten bases in length that is hybridizable 
to the isolated nucleic acid molecule of claim 1 under stringent conditions. 

20 5. An isolated nucleic acid molecule encoding the polypeptide of claim 2. 

6. An isolated nucleic acid molecule encoding a fragment of the polypeptide of 
claim 2. 

7. An isolated nucleic acid molecule encoding a polypeptide epitope of the 
polypeptide of claim 2. 

25 8. The polypeptide of claim 2 wherein the polypeptide has biological activity. 

9. An isolated nucleic acid encodihg a species homologue of the polypeptide of 
claim 2. 

10. The isolated nucleic acid molecule of claim 1, wherein the nucleotide sequence 
comprises sequential nucleotide deletions from either the 5 '-terminus or the 3 '-terminus. 

30 1 1 . A recombinant vector comprising the isolated nucleic acid molecule of claim I . 

12. A recombinant host cell comprising the isolated nucleic acid molecule of claim 



1. 



13. A method of making the recombinant host cell of claim 12. 

14. The recombinant host cell of claim 12 comprising vector sequences. 
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15. The isolated polypeptide of claim 2, wherein the isolated polypeptide comprises 
sequential amino acid deletions from either the C-terminus or the N-terminus. 

1 6. An isolated antibody that binds specifically to the isolated polypeptide of claim 

2. 

5 1 7. The isolated antibody of claim 16 wherein the antibody is a monoclonal 

antibody. 

18. The isolated antibody of claim 16 wherein the antibody is a polyclonal antibody. 

19. A recombinant host cell that expresses the isolated polypeptide of claim 2. 

20. An isolated polypeptide produced by the steps of: 

10 (a) culturing the recombinant host cell of claim 14 under conditions such that 

said polypeptide is expressed; and 

(b) isolating the polypeptide. 

21. A method for preventing, treating, modulating, or ameliorating a medical 
condition, comprising administering to a mammalian subject a therapeutically effective 

15 amount of the polypeptide of claim 2 or the polynucleotide of claim 1 . 

22. The method of claim 21 wherein the medical condition is ataxia telangiectasia. 

23. A method for preventing, treating, modulating, or ameliorating a medical 
condition comprising administering to a mammalian subject a therapeutically effective 
amount of the antibody of claim 16. 

20 24. The method of claim 23 wherein the medical condition is ataxia telangiectasia. 

25. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide 

of claim 1 ; and 

25 (b) diagnosing a pathological condition or a susceptibility to a pathological 

condition based on the presence or absence of said mutation. 

26. The method of claim 25 wherein the pathological condition is ataxia 
telangiectasia. 

27. A method of diagnosing a pathological condition or a susceptibility to a 

30 pathological condition in a subject comprising detecting an alteration in expression of a 

polypeptide encoded by the polynucleotide of claim 1, wherein the presence of an 
alteration in expression of the polypeptide is indicative of the pathological condition or 
susceptibility to the pathological condition. 
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28. The method of claim 27 wherein the alteration in expression is an increase in the 
amount of expression or a decrease in the amount of expression. 

29. The method of claim 27 wherein the pathological condition is ataxia 
telangiectasia. 

5 30. The method of claim 29 wherein the method further comprises the steps of: 

obtaining a first biological sample from a patient suspected of having ataxia 
telangiectasia and obtaining a second sample from a suitable comparable control source; 
(a) determining the amount of at least one polypeptide encoded by a 
polynucleotide of claim 1 in the first and second sample; and 
10 (b) comparing the amount of the polypeptide in the first and second samples; 

wherein a patient is diagnosed as having ataxia telangiectasia if the amount of 
the polypeptide in the first sample is greater than or less than the amount of the 
polypeptide in the second sample. 

3 1 . The use of the polynucleotide of claim 1 or polypeptide of claim 2 for the 
15 manufacture of a medicament for the treatment of ataxia telangiectasia. 

32. The use of the antibody of claim 1 6 for the manufacture of a medicament for the 
treatment of ataxia telangiectasia. 

33. A method for identifying a binding partner to the polypeptide of claim 2 
comprising: 

20 (a) contacting the polypeptide of claim 2 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the polypeptide. 

35. The gene corresponding to the cDNA sequence of the isolated nucleic acid of 
claim 1. 

36. A method of identifying an activity of an expressed polypeptide in a biological 
25 assay, wherein the method comprises: 

(a) expressing the polypeptide of claim 2 in a cell; 

(b) isolating the expressed polypeptide; 

(c) testing the expressed polypeptide for an activity in a biological assay; and 

(d) identifying the activity of the expressed polypeptide based on the test results. 
30 37. A substantially pure isolated DN A molecule suitable for use as a probe for genes 

regulated in ataxia telangiectasia, chosen from the group consisting of SEQ ID NO: 1 , 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:ll, SEQ ID NO:12, 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ 
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ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID NO:2 1 , SEQ ID NO:22, and SEQ 
ID NO:23. 

38. A kit for detecting the presence of the polypeptide of the claim 2 in a 
mammalian tissue sample comprising a first antibody which immunoreacts with a mammalian 

5 protein encoded by a gene corresponding to the polynucleotide of claim 1 or with a polypeptide 
encoded by the polynucleotide of claim 2 in an amount sufficient for at least one assay and 
suitable packaging material. 

39. A kit of claim 38 fiirther comprising a second antibody that binds to the first 
antibody. 

1 0 40. The kit of claim 39 wherein the second antibody is labeled. 

41 . The kit of claim 40 wherein the label comprises enzymes, radioisotopes, 
fluorescent compounds, colloidal metals, chemiluminescent compounds, phosphorescent 
compounds, or bioluminescent compounds, 

42. A kit for detecting the presence of a genes encoding an protein comprising a 

15 polynucleotide of claim 1, or fi-agment thereof having at least 10 contiguous bases, in an 

amount sufficient for at least one assay, and suitable packaging material. 

43. A method for detecting the presence of a nucleic acid encoding a protein in a 
mammalian tissue sample, comprising the steps of: 

(a) hybridizing a polynucleotide of claim 1 or fi-agment thereof having at least 10 
20 contiguous bases, with the nucleic acid of the sample; and 

(b) detecting the presence of the hybridization product. 
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SEQUENCE LISTING 

<110> Barlow, Carrol ee 

Win row, Christopher 3 
Callahan, Marie Lei A 
Pankratz, Daniel G 

<120> Gene Expression Modulated in Ataxia Telangiectasia Tumorigenesis 

<130> 99-529B 

<150> 60/149,868 
<151> 1999-08-19 

<160> 52 

<170> Patentin version 3.1 

<210> 1 

<211> 406 

<212> DNA 

<213> Mus musculus 

<400> 1 



cggacgggga 


attcccaaaa 


acactgcacg 


aagttaagct 


gacagtacag 


aaggatcagg 


60 


tgtgtgagtc 


ccagttccaa 


agttcttaca 


acagagctaa tgagatatgt 


gtgggagact 


120 


caaagatcaa 


gggagcttcc 


tttgaggagg 


attctggagg 


cccgcttgtg 


tgtaaaagag 


180 


cagctgcagg 


catcgtctcc 


tacgggcaaa 


ctgatggatc 


agctccacaa 


gtcttcacaa 


240 


gagttttggg 


ttttgtatcg 


tggataaaga 


aaacgatgaa 


acacagctaa 


ctacaagaag 


300 


caaactagat 


cctgatctga 


ccagccatct 


tccccatagc 


tgagtccagg 


attgctctag 


360 


gacagatggc 


agcaactgaa 


taaagaactt 


tctctgactc 


caaaaa 




406 


<210> 2 
<211> 54 
<212> DNA 
<213> Mus 


musculus 












<400> 2 
cggcagaggc 


tcctggcctg 


cacttttcct 


attaaagtca 


ctgacacaca 


aaaa 


54 


<210> 3 
<211> 101 
<212> DNA 
<213> MUS 


musculus 












<400> 3 
cggccgtgtg 


tgccttagga 


gtgggaaact 


ttgcatttct 


ctctccttat 


ccttcttgta 


60 


agacatccat 


ttaataaagt 


ctcatgctga 


gagagtaaaa 


a 




101 



<210> 4 
<211> 258 



Page 1 



<212> DNA 

<213> Mus musculus 

<400> 4 

cggctcgcag agtgaaggca cccagccatc aggaatgcag ccctgccagc caggcaccat 
gggtggcagc catcatgctt tcatgaccat tgggcctgct ggtctacctg gtcttagccc 
aggaagccac caggtaggtt agggtggtca gtgccgagtc tcctgccaga cacagttata 
cctgcctttc tgcactgctc caggcatgcc cttagagcat ggtgttttaa agctaaataa 
agtctctaac tacaaaaa 



<210> 5 

<211> 116 

<212> DNA 

<213> Mus musculus 

<400> 5 

cgggctcagc actgtctgct cacgtcccca ctgttgcaac acctagtctg tttgcaagga 
ggacaggaca ggtcacatgg agctttatgt caataaagtc tttatcttgt caaaaa 



<210> 6 

<211> 68 

<212> DNA 

<213> Mus musculus 

<400> 6 

cggaacgggg gatgatgggg gattggaaga acttgaactt tttattaaaa tattgtaagt 
cctaaaaa 



<210> 7 

<211> 172 

<212> DNA 

<213> Mus musculus 

<400> 7 

cggaccgcca tgcctggtga gcatatccat cctttgaaag atacactcat gtcctaaacg 
gagctgtgac tgagatttag tgttttcaat atgaacaaaa agcatttgcc tagcaaatta 
agttcaaatg aaggcttagt cttttgaaga ataaagcata aaaccatcaa aa 



<210> 8 

<211> 151 

<212> DNA 

<213> Mus musculus 

<400> 8 

cggacgctgg agccttcctt aatccacacc tttgaagcca attctcaact ggttgccgtg 
tacactgatt acatttcagg aggacgtgtt ctatttatgt tgtgcctctg caggcaaagc 
ccttaataaa tattttgcat actttcaaaa a 
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<210> 9 

<211> 300 

<212> DNA 

<213> MUS musculus 

<400> 9 



cggacggtgt 


accccgagga 


tcgccccagg 


tggagggaaa 


gatccaggac 


caggtcgcgc 


DU 


agcaggagta 


gaaccccatt 


tcgcctgtgt 


gagaaagatc 


gaatggagct 


actagaaata 


120 


gcaaaagcca 


acgcagcaaa 


agctctggga 


acagccaact 


tcgacttgcc 


agcaagtctc 


180 


cgagccaagg 


aggcaagcca 


ggggacagct 


gtttccagca 


gtgggccaaa 


ggtggagcat 


240 


tcagaaaagc 


agactgaaga 


tacaactaaa 


aataccagtg 


aaaagtcttc 


tacacaaaaa 


300 


<210> 10 
<211> 43 
<212> DNA 
<213> MUS 


musculus 












<400> 10 
cggagtcccc 


tcctctgaat 


taaaatcaca 


ctgagcggaa 


aaa 




43 


<210> 11 
<211> 400 
<212> DNA 
<213> MUS 


musculus 












<400> 11 
cggcatcgtc 


cttgttaagc 


actgctttcc 


aqaccqtqtc 


tgacttagtc 


aacctagagg 


60 


attttatagt 


ttcactctaa tgaagtcctg 


attaacacag 


agcagcggag 


cagcggtccc 


120 


tcccctgagt 


ggcagcagca 


ggaagccccg 


tttgtaatgt 


gcgttgcgct 


gctttaccgt 


180 


acacccagtg 


gcattctttt 


tactataacg 


gttttccccc 


ctctcagaag 


agaaatctta 


240 


aaatttgcaa cagacctcct 


aacttttgtt 


tacttttttt 


gtcttaacga tagatttgaa 


300 


aatgaaaaat 


gttaaaaaga caactggatc 


tgttgtccgt 


attatatttg 


caatctgaaa 


360 


tttgtgtgaa ttgatttagt 


aaaatgttaa 


actggaaaaa 






400 


<210> 12 
<211> 219 
<212> DNA 
<213> MUS 


musculus 












<400> 12 
cggcctggac 


cacatcacgg 


aggaagaact 


gaaacggttc 


aagtactttg 


ctttgactga 


60 


gtttcagatt 


gccaggagca 


cactcgacgt 


cgcagatagg 


acagaattag 


ctgaccacct 


120 


gattcaaagt 


gcatgtgcgg 


cgtctggagt 


gaccaaggcc 


attaatattt 


tccagaagtt 


180 
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gaattatatg catattgcaa atgctcttga agagaaaaa 



219 



<210> 13 

<211> 89 

<212> DNA 

<213> Mus musculus 

<400> 13 ^r. 

cgggctcttt gcacagttct aacctataca acttggtctg ctcctcctac caacttattt 60 
aatagacatt aaagtaaaac gcccaaaaa 89 

<210> 14 

<211> 42 

<212> DNA 

<213> Mus musculus 

<400> 14 

cggctatcag tgctaacaga aaacattgta gaaggccaaa aa 42 

<210> 15 

<211> 247 

<212> DNA 

<213> MUS musculus 

<400> 15 

cggaccatcc ctgctctgtg tccatctact catccaatgg agtcattagc agtcaatcgc bO 

agcttcactg gacactgagg ggtcacagac ttaggctccc tttgggtcac gtccagcgtg 120 

tcctagactt tatcatcttt cagaggcggt aggcagactg ttcacaaagg ctttctctag 180 

ctttccactg caattaatct tggtcactcc ctcaaataag tatatttttt ctagaaaagg 
ggaaaaa 



240 
247 



<210> 16 

<211> 260 

<212> DNA 

<213> Mus musculus 

<400> 16 

cggaccctga ttcgcatcat ggtgtctcgc ggcgagcttg acctgctgga tatccgagca bO 

gagtataagc ggatgtacgg caagtcactg taccacgata tcacgggaga tacttctgga 120 

gactatcgaa agattccact gaagatctgt ggtggcaacg actgagctgt ggcagtggct 180 

cacctgtgcc ccctgctggc attggccagg aaacgtccca aagagcacac ttcggtttcc 240 
ctatctaata accccaaaaa 

<210> 17 

<211> 209 

<212> DNA 

<213> MUS musculus 



260 



Page 4 



<400> 17 

cggaccgtgc aaaggtagca taatcacttg ttccttaatt agggactagc atgaacggct 
aaacgagggt ccaactgtct cttatcttta atcagtgaaa ttgacctttc agtgaagagg 
ctgaaatata ataataagac gagaagaccc tatggggctt aaattatata acttatctat 
ttaatttatt aaacctaatg gcccaaaaa 

<210> 18 

<211> 385 

<212> DNA 

<213> Mus musculus 

<400> 18 

cggagtggac agcacaagcc atggaagcgc agatggccca gtgagccctc agcgctgtgc 
gcactgcatc tttgccactt cgtgcagatt gattcactgc ccctcaaagc atttgcatca 
tgaccttaga catctcagcc tcctttgctg gatcccactt gaagaagatg ctagagcaag 
aagtgcatcg tttaaacata atagctcccg tgagatctgg gtctgtgtat tgaactaact 
cagacgcttt gaaaacttgt tctttaagtg gaagtgaaag acctgaagtt tgggatttgt 
gatcactttg tagtcatgta aaacttaagt gctttgtgcc tctccacatg tggttactct 
aataaacgga gcggtgagcc aaaaa 

<210> 19 

<211> 83 

<212> DNA 

<213> Mus musculus 

<400> 19 

cggccgaggc gaggcgccgc gcggaaaacc gcggcgggtg ttgacgcgat gtgatttctg 
cccagtgctc tgaatgtcaa aaa 

<210> 20 

<211> 286 

<212> DNA 

<213> Mus musculus 

<400> 20 

cgggggccaa gacgaggagg tgctgatgat gaacggtcat cctggcggaa tgctgatgat 
gataggggtc ccaggagagg catggatgat gataggggtc ccaggcgagg gctggatgat 
gaccgaggac cttggaggaa tgctgctgaa gataggattt ccaggagagg tgcagatgat 
gaccgagggc cttggagaaa tatggatgat gatagggttc ctagaagggg tgatgatgca 
agacctggtc cttggagatc atttgtcgag ccaggtggat gaaaaa 

<210> 21 
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<211> 124 
<212> DNA 
<213> Mus musculus 

<400> 21 

cggtaccaat gtatagctgt gttggagaaa tatggaagat aaactataca ggatatttgc 60 

acattctcat ctgtctctct ggtgaccgat aattcagttt gctccagctg cccaagccca 120 

aaaa 124 



<210> 22 

<211> 68 

<212> DNA 

<213> Mus musculus 

<400> 22 

cggtagcggg aacaccgaga acgaagccag cctggctttc gaaaacgcct gggtttgctg 
gagaaaaa 



60 
68 



<210> 23 

<211> 160 

<212> DNA 

<213> Mus musculus 

<400> 23 

cggtgcagca ccgtgaggtc cagtaccacg agtccgacac cttccgtggc tacttcaagc 
agggcatcat ctataaaaag gggggtgtgg cctctgggat gaagcacgtg gagaccaaca 
cctatgatgt gaagaggctg ctacacgtaa aggggaaaaa 



60 
120 
160 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 

<220> 
<221> 
<222> 
<223> 



24 
48 
DNA 

Arti f i ci al sequence 



Description of Artificial sequence: synthetic primer 



misc_feature 
(47) . . (48) 

N stands for any nucleotide. 



<220> 

<221> misc_feature 

<222> (46).. (46) 

<223> V stands for A, G, or C. 



<400> 24 

gaattcaact ggaagcggcc gcaggaattt tttttttttt tttttvnn 



48 



<210> 25 
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<211> 16 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: synthetic primer 

<400> 25 
aggtcgacgg tatcgg 

<210> 26 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic primer 
<220> 

<221> misc_feature 

<222> (16).. (16) 

<223> N stands for any nucleotide. 

<400> 26 
ggtcgacggt atcggn 

<210> 27 

<211> 15 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: synthetic primer 

<400> 27 
gagctccacc gcggt 

<210> 28 

<211> 16 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial sequence: synthetic primer 

<220> 

<221> misc_feature 

<222> (13).. (16) 

<223> N stands for any nucleotide. 

<400> 28 
cgacggtatc ggnnnn 

<210> 29 

<211> 16 
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<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of Artificial Sequence: synthetic primer 
<400> 29 

cgacggtatc gggagc 16 

<210> 30 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 
_7. 

<400> 30 

gatcgaatcc ggacggtgta ccccgaggat 30 

<210> 31 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for barI 

<400> 31 

gatcgaatcc gggctctttg cacagttcta 30 



<210> 32 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Extended TOGA primer for BARl 
^11. 

<400> 32 

gatcgaatcc ggccgtgtgt gccttaggag 30 



<210> 33 

<211> 30 

<212> DNA 

<213> Arti f i ci al sequence 

<220> 

<223> Description of Artificial Sequence: Extended toga primer for BARl 
_16. 

<400> 33 

gatcgaatcc ggaacggggg atgatggggg 30 
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<210> 34 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 
^20. 

<400> 34 

gatcgaatcc ggtaccaatg tatagctgtg 30 

<210> 35 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 
-21. 

<400> 35 

gatcgaatcc ggcagaggct cctggcctgc 30 

<210> 36 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 
_22 . 

<400> 36 

gatcgaatcc ggctatcagt gctaacagaa 30 

<210> 37 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 
-24. 

<400> 37 , 

gatcgaatcc gggctcagca ctgtctgctc 30 

<210> 38 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Extended TOGA primer for BARl 
-27. 
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<400> 38 

gatcgaatcc ggacggggaa ttcccaaaaa 



<210> 39 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial sequence: Extended TOGA primer for 
-28. 

<400> 39 

gatcgaatcc ggctcgcaga gtgaaggcac 

<210> 40 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> ^ ^ jz 

<223> Description of Artificial Sequence: Extended TOGA primer tor 

_29. 
<400> 40 

gatcgaatcc ggagtcccct cctctgaatt 



<210> 41 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> . , . r: 

<223> Description of Artificial Sequence: Extended TOGA primer tor 
^34. 

<400> 41 

gatcgaatcc ggaccatccc tgctctgtgt 

<210> 42 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> . r: 

<223> Description of Artificial sequence: Extended TOGA primer for 
_35. 

<400> 42 

gatcgaatcc ggaccctgat tcgcatcatg 

<210> 43 

<211> 30 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 
_37. 

<400> 43 

gatcgaatcc ggaccgccat gcctggtgag 30 

<210> 44 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Extended TOGA primer for barI 
_38. 

<400> 44 

gatcgaatcc ggaccgtgca aaggtagcat 30 

<210> 45 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended toga primer for BARl 
-39. 

<400> 45 

gatcgaatcc ggacgctgga gccttcctta 30 

<210> 46 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Extended toga primer for BARl 
^41. 

<400> 46 

gatcgaatcc ggagtggaca gcacaagcca 30 

<210> 47 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 
^49. 

<400> 47 

gatcgaatcc ggcatcgtcc ttgttaagca 30 
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<210> 48 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 

<400> 48 

gatcgaatcc ggccgaggcg aggcgccgcg 30 

<210> 49 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Extended toga primer for BARl 
-52. 

<400> 49 

gatcgaatcc ggcctggacc acatcacgga 30 

<210> 50 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 
-62 . 

<400> 50 

gatcgaatcc gggggccaag acgaggaggt 30 

<210> 51 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Extended TOGA primer for BARl 
-67. 

<400> 51 

gatcgaatcc ggtagcggga acaccgagaa 30 

<210> 52 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> ' , 

<223> Description of Artificial Sequence: Extended TOGA primer for BARl 
-69. 
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<400> 52 

gatcgaatcc ggtgcagcac cgtgaggtcc 



30 
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